INDEX
Explanations
mentions of violent stabbing or stabbing incidents
New Auto-Interp
Negative Logits
DebuggerNonUser
-0.74
webElementGuid
-0.60
AccessorTable
-0.60
opérateurs
-0.60
Públicas
-0.59
économies
-0.59
SourceChecksum
-0.57
unstable
-0.57
oa̍t
-0.56
незавершена
-0.56
POSITIVE LOGITS
stab
3.50
stab
2.55
Stab
2.14
Stab
1.95
stabbed
1.77
stabbing
1.77
捅
0.60
slashes
0.59
Hochspringen
0.57
jab
0.57
Activations Density 0.002%