INDEX
Explanations
phrases related to incidents or events with negative outcomes, such as accidents, deaths, and damage
phrases that describe events resulting in injuries or fatalities
New Auto-Interp
Negative Logits
iths
-0.80
Same
-0.76
atech
-0.74
ith
-0.74
nurture
-0.72
beh
-0.71
wcsstore
-0.71
behaves
-0.68
acy
-0.66
thren
-0.65
POSITIVE LOGITS
deaths
1.14
fatalities
1.13
amput
1.06
death
0.96
hospitalized
0.94
injuring
0.93
ãĥĩãĤ£
0.90
injuries
0.89
casualties
0.88
miscarriage
0.84
Activations Density 0.272%