INDEX
Negative Logits
-major
-0.07
丽
-0.07
Statements
-0.06
.UserID
-0.06
ोख
-0.06
наруж
-0.06
Bik
-0.06
mourn
-0.06
onComplete
-0.06
Fifth
-0.06
POSITIVE LOGITS
trolling
0.07
나는
0.06
joining
0.06
(requestCode
0.06
참
0.06
الل
0.06
$_
0.06
.c
0.05
sell
0.05
oret
0.05
Activations Density 0.223%