INDEX
Explanations
phrases related to violent incidents, specifically shootings and gunmen
references to shooting incidents or violent events
New Auto-Interp
Negative Logits
GY
-0.76
undai
-0.74
adian
-0.70
eday
-0.69
rian
-0.68
amel
-0.67
hw
-0.67
Label
-0.67
OLOGY
-0.65
anium
-0.65
POSITIVE LOGITS
spree
1.20
rampage
1.10
powder
0.89
nikov
0.88
shootings
0.85
deaths
0.84
massacre
0.81
Shooter
0.81
shooter
0.81
shooting
0.81
Activations Density 0.039%