INDEX
Explanations
locations or organizations, particularly related to announcements or political events
occurrences of the word "Washington."
New Auto-Interp
Negative Logits
Ep
-0.65
PC
-0.65
Bj
-0.64
Poly
-0.64
Book
-0.62
Moor
-0.62
Nik
-0.61
inf
-0.61
number
-0.61
closed
-0.61
POSITIVE LOGITS
WASHINGTON
4.06
ASHINGTON
2.12
Washington
1.90
SAN
1.53
NEW
1.36
TRUMP
1.35
SHARE
1.35
YORK
1.32
Congress
1.28
LOS
1.28
Activations Density 0.017%