INDEX
Explanations
words related to violent incidents involving guns
references to police-related shootings and homicides
New Auto-Interp
Negative Logits
ebus
-0.73
Canal
-0.68
coat
-0.64
Bey
-0.63
amina
-0.61
arity
-0.61
ULE
-0.59
LESS
-0.59
tre
-0.58
Hat
-0.56
POSITIVE LOGITS
uggest
1.22
ettings
1.14
hips
1.11
poons
1.08
hip
1.00
paces
0.99
hooting
0.95
mith
0.95
ourcing
0.90
ynthesis
0.88
Activations Density 0.080%