INDEX
Explanations
reference to criminal activities and individuals involved in illegal actions
references to criminals and criminality
New Auto-Interp
Negative Logits
HAHA
-0.67
arp
-0.65
urgical
-0.65
Ħ¢
-0.64
chell
-0.64
chron
-0.63
ŃĶ
-0.63
orse
-0.62
yip
-0.62
rolog
-0.62
POSITIVE LOGITS
mastermind
0.95
prey
0.88
gangs
0.88
trafficking
0.81
criminals
0.79
robbing
0.78
offenders
0.78
stealing
0.78
trespass
0.77
smugglers
0.77
Activations Density 0.015%