INDEX
Explanations
references to violent actions involving firearms
occurrences of the word "fire" in various contexts, especially related to violent actions
New Auto-Interp
Negative Logits
Birth
-0.70
nai
-0.70
educated
-0.69
ilings
-0.64
obs
-0.63
inen
-0.62
Brom
-0.62
elsen
-0.61
rome
-0.60
Gi
-0.60
POSITIVE LOGITS
fights
0.88
indiscrim
0.84
sidx
0.83
bomb
0.81
storm
0.79
geoning
0.77
[|
0.76
exting
0.76
fight
0.74
inflicting
0.73
Activations Density 0.026%