INDEX
Explanations
locations or specific geographical regions
New Auto-Interp
Negative Logits
NCT
-0.75
arnaev
-0.71
DonaldTrump
-0.71
PsyNetMessage
-0.67
ASED
-0.66
amins
-0.65
Clause
-0.63
ufact
-0.62
Chatt
-0.62
Polit
-0.61
POSITIVE LOGITS
pipe
1.00
indal
0.95
ophon
0.85
sey
0.84
church
0.82
ed
0.82
wig
0.82
beck
0.81
bill
0.81
spr
0.81
Activations Density 0.014%