INDEX
Explanations
information related to law enforcement activities
references to police activities or law enforcement actions
New Auto-Interp
Negative Logits
amental
-0.71
ALEC
-0.70
Works
-0.67
TABLE
-0.66
moderation
-0.65
whining
-0.64
ufact
-0.63
lies
-0.62
iscons
-0.62
bench
-0.61
POSITIVE LOGITS
arrested
1.20
arrests
1.12
suspect
1.06
apprehended
1.04
apprehend
1.03
suspects
1.02
Detect
1.00
arresting
0.99
detained
0.99
nab
0.99
Activations Density 0.200%