INDEX
Explanations
people or groups being punished or prosecuted
references to individuals and groups in the context of speech, expression, and legal consequences
New Auto-Interp
Negative Logits
ggles
-0.74
rawdownloadcloneembedreportprint
-0.74
++++++++++++++++
-0.68
atility
-0.65
Surviv
-0.63
kefeller
-0.63
teamwork
-0.62
Recovery
-0.61
BuyableInstoreAndOnline
-0.61
flashback
-0.59
POSITIVE LOGITS
suspected
1.21
deemed
1.19
accused
1.15
convicted
1.10
whose
1.09
who
1.09
exercising
1.05
violating
1.01
who
0.98
complicit
0.96
Activations Density 0.278%