INDEX
Explanations
references to legal charges or criminal activities
New Auto-Interp
Negative Logits
wor
-0.14
æĺŃ
-0.14
à¥ģलन
-0.14
fik
-0.14
imon
-0.14
sue
-0.13
ipop
-0.13
ênh
-0.13
ÑĪиб
-0.13
vor
-0.13
POSITIVE LOGITS
charges
0.26
bond
0.24
charge
0.24
warrants
0.24
bond
0.23
charged
0.23
booking
0.22
charging
0.22
warrant
0.21
arrest
0.21
Activations Density 0.093%