INDEX
Explanations
high-ranking political positions and individuals
references to political figures and titles
New Auto-Interp
Negative Logits
Sensor
-0.68
beh
-0.67
fingert
-0.67
ickle
-0.66
ensitivity
-0.66
radius
-0.65
urat
-0.63
itivity
-0.62
toolbar
-0.61
prelim
-0.61
POSITIVE LOGITS
Yugoslavia
0.85
Desmond
0.77
turned
0.75
Lyndon
0.72
ãĤ¶
0.71
Edward
0.69
Fidel
0.68
Lowell
0.67
oslov
0.67
Olympia
0.67
Activations Density 0.179%