INDEX
Explanations
phrases related to political regimes
references to authoritarian or oppressive governments
New Auto-Interp
Negative Logits
Redd
-0.80
Kent
-0.75
Kin
-0.72
Ort
-0.72
Constructed
-0.72
Lent
-0.72
Scot
-0.71
Edinburgh
-0.71
ritch
-0.70
Hir
-0.69
POSITIVE LOGITS
regime
1.36
regimes
1.30
dictator
0.95
dictatorship
0.92
rul
0.92
aution
0.88
administ
0.87
ollah
0.87
governing
0.86
achev
0.86
Activations Density 0.007%