INDEX
Explanations
phrases related to police reports, complaints, and criminal activities
references to police reports and legal documents
New Auto-Interp
Negative Logits
warts
-0.85
reddits
-0.84
rities
-0.81
%%
-0.79
obyl
-0.79
<+
-0.76
tics
-0.76
selves
-0.72
Mods
-0.72
ihad
-0.69
POSITIVE LOGITS
affidavit
1.35
probable
1.31
arrest
1.23
affidav
1.11
records
1.01
booking
1.01
statement
1.01
court
0.99
criminal
0.99
transcript
0.99
Activations Density 0.170%