INDEX
Explanations
words related to political entities or movements, such as names of political figures or organizations
variations of the word "em."
New Auto-Interp
Negative Logits
Kubrick
-0.65
bed
-0.64
Rockies
-0.64
SourceFile
-0.63
Toad
-0.60
steroids
-0.56
obsc
-0.56
Walters
-0.56
Mighty
-0.56
Encyclopedia
-0.56
POSITIVE LOGITS
issions
1.23
peror
1.19
achine
1.18
otional
1.14
bley
1.09
bourg
1.08
phasis
1.07
useum
1.07
ple
1.07
igrant
1.05
Activations Density 0.030%