INDEX
Explanations
terms related to offenders and offending behavior
New Auto-Interp
Negative Logits
اÙĬÙĨ
-0.16
Propel
-0.16
ëĭĪìķĦ
-0.16
MAS
-0.16
Mas
-0.15
Mas
-0.15
resh
-0.15
etti
-0.14
fait
-0.14
.fn
-0.14
POSITIVE LOGITS
rana
0.17
ischer
0.15
çª
0.14
chan
0.14
h
0.14
ickers
0.14
ucid
0.13
orge
0.13
nap
0.13
rite
0.13
Activations Density 0.007%