INDEX
Explanations
people involved in legal or political actions
mentions of independent politicians or activists facing consequences from the government
New Auto-Interp
Negative Logits
Purg
-0.75
reci
-0.66
sear
-0.63
Recovery
-0.62
Surviv
-0.62
whirlwind
-0.59
Horror
-0.59
imon
-0.58
ount
-0.58
Mem
-0.58
POSITIVE LOGITS
transgress
1.11
tresp
1.02
violating
0.98
contra
0.98
dared
0.98
unlawfully
0.95
disobedience
0.94
violations
0.93
illegally
0.93
improperly
0.92
Activations Density 0.693%