INDEX
Explanations
references to human rights issues and activism
New Auto-Interp
Negative Logits
HttpFoundation
-0.52
shooter
-0.50
Shooter
-0.48
лан
-0.48
Schn
-0.46
oulder
-0.46
исленность
-0.45
trương
-0.44
shooters
-0.44
shooting
-0.44
POSITIVE LOGITS
human
1.16
Human
1.06
Human
1.04
nahilalakip
1.03
Amnesty
1.03
rights
1.02
human
0.99
rights
0.98
Rights
0.97
HUMAN
0.97
Activations Density 0.138%