INDEX
Explanations
mentions of accidents or disasters
references to accidents and casualties
New Auto-Interp
Negative Logits
ophy
-0.72
Intern
-0.69
uchin
-0.66
oliberal
-0.66
utical
-0.64
angel
-0.64
idel
-0.63
butterflies
-0.61
emonic
-0.60
contractual
-0.59
POSITIVE LOGITS
spree
1.05
occurred
1.04
happened
0.97
perpetrated
0.85
sparked
0.82
involving
0.80
stemmed
0.79
victims
0.79
unfold
0.79
unfolded
0.78
Activations Density 0.220%