INDEX
Explanations
terms related to police actions and encounters with civilians
New Auto-Interp
Negative Logits
GraphicsUnit
-0.76
+#+#
-0.75
autorytatywna
-0.57
виправивши
-0.55
مواليد
-0.54
ویکیپدیا
-0.54
BoxFit
-0.54
tagext
-0.53
ostavi
-0.52
MouseClicked
-0.52
POSITIVE LOGITS
police
1.17
law
1.01
authorities
0.85
cops
0.85
security
0.84
police
0.80
polizia
0.79
policemen
0.77
policía
0.73
law
0.70
Activations Density 0.422%