INDEX
Explanations
mentions of the word "dictator" or its variations
terms related to the concept of "dictator" or authoritarian leadership
New Auto-Interp
Negative Logits
ciation
-0.69
Kin
-0.68
bye
-0.68
meric
-0.67
CVE
-0.67
knit
-0.66
lde
-0.64
Shelby
-0.64
Sisters
-0.64
Mid
-0.64
POSITIVE LOGITS
enance
0.99
naire
0.90
rypt
0.83
ory
0.83
ificate
0.82
ures
0.82
ict
0.82
orians
0.82
orian
0.80
о
0.80
Activations Density 0.014%