INDEX
Explanations
geographical locations, specifically cities and regions
city names and locations related to news articles
New Auto-Interp
Negative Logits
Gamble
-0.71
Lomb
-0.70
Cipher
-0.69
Loft
-0.68
Lieberman
-0.67
Verb
-0.65
Clapper
-0.65
Birch
-0.64
Priv
-0.63
Staten
-0.63
POSITIVE LOGITS
ION
0.93
IONS
0.90
URA
0.83
CITY
0.81
IVERS
0.80
ENN
0.80
ANA
0.80
ANCE
0.79
ES
0.79
ARY
0.78
Activations Density 0.051%