INDEX
Explanations
references to people passing away, particularly due to illnesses or age
New Auto-Interp
Negative Logits
MN
-0.73
arov
-0.71
orney
-0.70
dan
-0.70
broom
-0.69
iola
-0.67
rupulous
-0.67
ãĥ¼ãĥ
-0.67
GY
-0.64
rouse
-0.64
POSITIVE LOGITS
tragically
1.12
horribly
1.06
intest
1.03
peacefully
1.02
prematurely
1.00
miser
0.95
mysteriously
0.93
psychiat
0.85
unexpectedly
0.83
ffen
0.80
Activations Density 0.512%