INDEX
Explanations
mentions or references to death
references to death
New Auto-Interp
Negative Logits
properties
-0.75
apan
-0.71
LET
-0.70
ña
-0.68
ibe
-0.66
sb
-0.66
oline
-0.64
lett
-0.64
ï¸ı
-0.63
vouchers
-0.63
POSITIVE LOGITS
dead
3.84
dead
2.68
Dead
2.41
deceased
2.30
DEAD
2.29
Dead
2.11
lifeless
1.69
corpses
1.56
corpse
1.55
dying
1.52
Activations Density 0.019%