INDEX
Explanations
mentions of political figures, particularly former presidents and vice presidents
references to political figures and their titles
New Auto-Interp
Negative Logits
Sensor
-0.68
atum
-0.64
eta
-0.63
radius
-0.63
âĹ¼
-0.63
fw
-0.62
endi
-0.62
Tree
-0.62
Limited
-0.60
issu
-0.60
POSITIVE LOGITS
Yugoslavia
0.89
Saddam
0.88
Colin
0.79
Lyndon
0.79
turned
0.78
Watergate
0.77
Newt
0.77
Abel
0.77
disgr
0.75
Yugoslav
0.75
Activations Density 0.175%