INDEX
Explanations
references to sheriff's offices
references to law enforcement, specifically the term "Sheriff."
New Auto-Interp
Negative Logits
lihood
-0.83
ctory
-0.73
umar
-0.71
aeda
-0.69
auri
-0.67
tar
-0.66
ces
-0.65
raining
-0.65
arial
-0.64
jad
-0.63
POSITIVE LOGITS
Sheriff
1.44
sheriff
1.14
Marshal
1.03
Sher
0.89
Supervisor
0.88
deputies
0.85
Arpaio
0.84
iffs
0.80
osate
0.80
sher
0.79
Activations Density 0.012%