INDEX
Explanations
words related to political events or actions
instances of the word "office" in relation to political positions
New Auto-Interp
Negative Logits
ostic
-0.74
alog
-0.72
oken
-0.70
essen
-0.69
asca
-0.66
oise
-0.66
Tang
-0.64
Side
-0.63
TERN
-0.62
osphere
-0.62
POSITIVE LOGITS
holders
0.88
clinton
0.78
yrim
0.72
holder
0.70
earable
0.67
prison
0.64
pledging
0.64
hopeful
0.63
long
0.61
matical
0.61
Activations Density 0.020%