INDEX
Explanations
references to political dictators
references to dictators and dictatorial regimes
New Auto-Interp
Negative Logits
older
-0.83
awks
-0.83
alle
-0.80
ilk
-0.80
ttp
-0.80
LAN
-0.78
atha
-0.76
Recommend
-0.76
IGH
-0.75
rb
-0.75
POSITIVE LOGITS
dictator
1.34
dictatorship
1.16
dictators
1.00
nomine
0.89
regime
0.85
overth
0.85
regimes
0.84
tyrant
0.82
tyranny
0.82
overthrow
0.81
Activations Density 0.009%