INDEX
Explanations
locations, especially in the United States and Canada
mentions of U.S. states and cities
New Auto-Interp
Negative Logits
Frankfurt
-0.75
Istanbul
-0.73
Ankara
-0.71
emort
-0.66
Karachi
-0.66
Situation
-0.65
20439
-0.63
Podesta
-0.62
Manila
-0.62
roadside
-0.61
POSITIVE LOGITS
nesota
0.79
sylvania
0.78
fecture
0.68
union
0.67
jee
0.67
Ct
0.64
astern
0.64
Ļ
0.64
notation
0.63
Hampshire
0.63
Activations Density 0.093%