INDEX
Explanations
references to incidents involving police reports or crime
New Auto-Interp
Negative Logits
upo
-0.16
acle
-0.15
ardless
-0.14
-urlencoded
-0.14
avana
-0.14
Medal
-0.14
halt
-0.14
Bone
-0.14
UTH
-0.14
ampionship
-0.13
POSITIVE LOGITS
erdem
0.18
Rus
0.15
oret
0.15
Ñģион
0.15
+++
0.14
568
0.14
PageInfo
0.14
Epstein
0.14
ly
0.14
iant
0.14
Activations Density 0.054%