INDEX
Explanations
police departments (PD) or specific activities related to them
references to police department abbreviations and associations
New Auto-Interp
Negative Logits
hold
-0.76
ppo
-0.75
fee
-0.69
Sphere
-0.69
play
-0.67
Hug
-0.66
sett
-0.66
tem
-0.64
bda
-0.63
borough
-0.62
POSITIVE LOGITS
PD
0.90
ATES
0.84
illon
0.84
orate
0.80
ective
0.79
iamond
0.78
encies
0.77
ail
0.76
illard
0.74
Constable
0.74
Activations Density 0.018%