INDEX
Explanations
instances of mentions of death or dying in various contexts
New Auto-Interp
Negative Logits
orney
-0.93
iola
-0.85
rupulous
-0.84
Allen
-0.82
MN
-0.81
Union
-0.81
arov
-0.80
Ĭ
-0.79
Ľ
-0.78
OPER
-0.77
POSITIVE LOGITS
horribly
1.22
tragically
0.97
toll
0.96
miser
0.95
intest
0.94
ffen
0.93
peacefully
0.92
getic
0.86
prematurely
0.85
psychiat
0.84
Activations Density 8.736%