INDEX
Explanations
phrases expressing strong positive sentiment
expressions of affection or admiration
New Auto-Interp
Negative Logits
merce
-0.77
icter
-0.75
iban
-0.75
ramid
-0.74
reluct
-0.73
ulhu
-0.73
enei
-0.73
interstitial
-0.72
SPONSORED
-0.72
thren
-0.71
POSITIVE LOGITS
birds
1.03
joy
0.89
uncond
0.89
dearly
0.88
fully
0.85
passionately
0.81
bird
0.74
Loving
0.72
LOVE
0.72
tsky
0.72
Activations Density 0.032%