INDEX
Explanations
references related to political regimes
references to regimes or authoritarian governments
New Auto-Interp
Negative Logits
lder
-0.82
Lent
-0.80
Ocean
-0.75
sight
-0.71
Nob
-0.71
Universal
-0.71
hire
-0.70
ritch
-0.68
ertodd
-0.67
Voy
-0.67
POSITIVE LOGITS
regime
1.02
overthrow
0.95
overth
0.93
regimes
0.91
dictatorship
0.90
Bashar
0.87
dictator
0.85
ollah
0.84
crackdown
0.83
milit
0.80
Activations Density 0.059%