INDEX
Explanations
the word "damned" either in isolation or as part of a compound word
intense emotional expressions and negative sentiments
New Auto-Interp
Negative Logits
endon
-0.80
Peel
-0.71
oya
-0.68
Kim
-0.67
archy
-0.66
ash
-0.65
Cele
-0.64
ath
-0.63
Kurdistan
-0.63
encer
-0.63
POSITIVE LOGITS
damned
2.59
raft
1.34
etting
1.32
doomed
1.26
bells
1.04
etter
0.98
detector
0.94
darn
0.92
souls
0.90
uld
0.88
Activations Density 0.062%