INDEX
Explanations
words related to authoritarian leadership and governance
New Auto-Interp
Negative Logits
Lent
-0.89
sight
-0.75
ertodd
-0.73
Pokemon
-0.73
Spiel
-0.73
lder
-0.72
Render
-0.72
Universal
-0.72
ificantly
-0.72
furt
-0.71
POSITIVE LOGITS
regimes
1.05
regime
1.03
overthrow
0.87
dictatorship
0.86
milit
0.85
overth
0.84
imposed
0.82
enforced
0.82
dictators
0.82
dictator
0.81
Activations Density 0.013%