INDEX
Explanations
references to political regimes
mentions of governmental authority or control, particularly referring to regimes
New Auto-Interp
Negative Logits
Lent
-0.88
sight
-0.78
lder
-0.75
Universal
-0.73
furt
-0.73
ritch
-0.72
Spiel
-0.72
Loch
-0.70
Redd
-0.69
eting
-0.68
POSITIVE LOGITS
regimes
0.99
regime
0.98
overthrow
0.85
dictatorship
0.83
dictators
0.80
overth
0.80
archs
0.79
dictator
0.79
milit
0.78
imposed
0.76
Activations Density 0.026%