INDEX
Explanations
incidents of violent crime involving shootings and stabbings
New Auto-Interp
Negative Logits
uce
-0.16
ró
-0.15
anken
-0.15
ngle
-0.15
idge
-0.14
aket
-0.14
hurt
-0.14
eprom
-0.13
862
-0.13
ient
-0.13
POSITIVE LOGITS
dead
0.18
defense
0.17
hole
0.16
Execution
0.16
execution
0.16
execution
0.16
dead
0.15
while
0.15
whilst
0.15
defence
0.15
Activations Density 0.066%