INDEX
Explanations
phrases related to fatal events or situations
terms related to fatal incidents and death
New Auto-Interp
Negative Logits
yang
-0.80
erous
-0.79
yi
-0.78
here
-0.76
orthy
-0.75
wright
-0.73
orers
-0.72
arte
-0.70
hey
-0.69
erson
-0.69
POSITIVE LOGITS
overdoses
1.03
flaw
1.02
overdose
1.02
gunshot
1.02
injection
1.01
shootings
1.00
blow
0.99
stabbing
0.94
dose
0.94
wounds
0.93
Activations Density 0.045%