INDEX
Explanations
references to death or the deceased
New Auto-Interp
Negative Logits
Macbeth
-0.67
BeginContext
-0.63
hâte
-0.59
meille
-0.56
fratelli
-0.56
veloce
-0.54
hermanos
-0.53
gusto
-0.52
롭
-0.52
kkor
-0.52
POSITIVE LOGITS
dead
4.39
dead
3.75
Dead
3.67
Dead
3.60
DEAD
3.25
DEAD
2.66
мерт
1.91
muerto
1.78
muertos
1.71
lifeless
1.59
Activations Density 0.094%