INDEX
Explanations
elements related to crime and criminal activity
New Auto-Interp
Negative Logits
ujet
-0.17
arger
-0.16
antro
-0.16
/mac
-0.15
ensa
-0.15
deo
-0.15
leton
-0.15
ague
-0.14
midterm
-0.14
opers
-0.14
POSITIVE LOGITS
whose
0.19
UNG
0.15
(es
0.15
Chatt
0.15
whom
0.15
ÏĨα
0.15
whose
0.14
illum
0.14
ess
0.14
amil
0.14
Activations Density 0.359%