INDEX
Explanations
references to law enforcement entities
mentions of the police
New Auto-Interp
Negative Logits
yip
-0.79
xual
-0.79
arget
-0.78
piring
-0.77
bp
-0.73
vironment
-0.72
arial
-0.72
igible
-0.71
comes
-0.70
ranged
-0.70
POSITIVE LOGITS
officers
1.13
officer
1.07
departments
1.01
commissioner
0.94
brutality
0.94
academy
0.92
Officers
0.92
blot
0.91
chiefs
0.89
Constable
0.86
Activations Density 0.061%