INDEX
Explanations
names of government officials or titles related to government positions
references to political figures and titles
New Auto-Interp
Negative Logits
Cherokee
-0.62
compartment
-0.58
Kubrick
-0.58
decomp
-0.57
boulder
-0.56
crew
-0.56
nikov
-0.53
ormon
-0.53
robbers
-0.52
forfe
-0.52
POSITIVE LOGITS
pport
0.67
Coun
0.65
warn
0.65
HUD
0.63
Minister
0.63
liam
0.63
teasp
0.62
oun
0.61
MPs
0.60
imon
0.60
Activations Density 0.294%