INDEX
Explanations
instances of physical damage or accidents
events involving injuries or damages
New Auto-Interp
Negative Logits
bern
-0.74
Companies
-0.67
}}}
-0.66
icons
-0.65
nda
-0.64
ocations
-0.63
leground
-0.62
resumes
-0.61
ertain
-0.60
Demand
-0.60
POSITIVE LOGITS
accidentally
1.26
injuring
1.16
unexpectedly
1.05
prematurely
1.04
tragically
1.03
inadvertently
0.98
unintentionally
0.96
injure
0.95
during
0.93
violently
0.90
Activations Density 0.366%