INDEX
Explanations
references to governmental or political entities, particularly states
New Auto-Interp
Negative Logits
hey
-0.19
andes
-0.19
_states
-0.18
ulk
-0.18
sov
-0.17
å¸Ĥ
-0.16
StateChanged
-0.16
thon
-0.16
th
-0.16
them
-0.16
POSITIVE LOGITS
craft
0.29
hood
0.28
Unidos
0.22
-of
0.21
wide
0.20
/local
0.19
manship
0.19
cipher
0.18
house
0.18
coach
0.18
Activations Density 0.092%