INDEX
Explanations
words related to positive emotions or satisfaction
instances of satisfaction or positive emotions
New Auto-Interp
Negative Logits
ut
-0.77
onut
-0.74
uli
-0.73
perse
-0.73
ciplinary
-0.71
çīĪ
-0.71
ammy
-0.69
ums
-0.65
ipers
-0.65
gger
-0.64
POSITIVE LOGITS
pleased
0.71
surprises
0.68
satisfied
0.67
Wynne
0.66
happily
0.66
iated
0.66
enough
0.65
congr
0.63
seeing
0.62
gladly
0.62
Activations Density 0.060%