INDEX
Explanations
the word "suspects"
mentions of suspects, particularly those involved in criminal activities
mentions of suspects involved in incidents
New Auto-Interp
Negative Logits
ntil
-0.77
vironment
-0.70
ammy
-0.67
tem
-0.66
ann
-0.66
abeth
-0.65
psey
-0.65
owan
-0.64
cast
-0.64
legraph
-0.64
POSITIVE LOGITS
suspects
1.08
suspect
0.93
mishand
0.73
offenders
0.71
suspected
0.67
istani
0.67
pots
0.66
infring
0.65
perpetrators
0.65
CSI
0.64
Activations Density 0.008%