INDEX
Explanations
incidents involving gun violence
New Auto-Interp
Negative Logits
uras
-0.17
Knife
-0.17
Sword
-0.15
è´¨
-0.15
Knife
-0.15
FLT
-0.15
à¤Ķ
-0.15
swords
-0.15
685
-0.14
Bomb
-0.14
POSITIVE LOGITS
shots
0.29
shots
0.26
shot
0.26
shooting
0.24
shoot
0.24
bullet
0.23
firing
0.23
fired
0.23
bullet
0.21
bullets
0.21
Activations Density 0.098%