INDEX
Explanations
mentions of specific geographical locations
New Auto-Interp
Negative Logits
wcs
-0.74
handshake
-0.70
filib
-0.68
recite
-0.67
ateral
-0.65
clutch
-0.64
ACE
-0.64
edi
-0.62
RM
-0.62
topical
-0.60
POSITIVE LOGITS
Janeiro
0.97
Indianapolis
0.95
Atlanta
0.91
Minneapolis
0.88
Seattle
0.86
Naples
0.86
Milwaukee
0.85
Tacoma
0.85
Tulsa
0.84
Cologne
0.84
Activations Density 0.103%