INDEX
Explanations
mention of criminal activities and justice-related contexts
New Auto-Interp
Negative Logits
hetically
-0.72
arity
-0.70
Centauri
-0.70
atable
-0.68
natureconservancy
-0.66
hetics
-0.66
por
-0.66
ersed
-0.66
mate
-0.64
DCS
-0.63
POSITIVE LOGITS
spree
1.12
fighting
1.02
fighter
0.99
perpetrated
0.89
fighters
0.89
ridden
0.83
ously
0.82
enforcement
0.82
prevention
0.81
gangs
0.81
Activations Density 0.036%