INDEX
Explanations
words related to injuries or harm, with a focus on casualties
terms related to casualties and fatalities
New Auto-Interp
Negative Logits
perm
-0.80
Gos
-0.67
ramid
-0.66
efer
-0.65
cer
-0.65
hist
-0.65
aired
-0.64
ribed
-0.64
aton
-0.63
nder
-0.62
POSITIVE LOGITS
casualties
1.36
casualty
1.14
bystanders
0.90
losses
0.82
fatalities
0.80
oided
0.80
Victims
0.77
inflicted
0.77
deaths
0.75
victims
0.75
Activations Density 0.008%