INDEX
Explanations
positive emotions and expressions of happiness
happiness or positive emotions
happiness and optimism
New Auto-Interp
Negative Logits
finesse
-0.45
ご注意
-0.39
save
-0.39
importanza
-0.38
nonUne
-0.38
fascinated
-0.38
않
-0.36
hilangan
-0.36
subtlety
-0.36
itoare
-0.35
POSITIVE LOGITS
positivity
0.80
upbeat
0.77
smile
0.77
Smile
0.77
cheerful
0.77
optimism
0.76
Smile
0.73
optimis
0.73
smiling
0.72
smiles
0.72
Activations Density 0.362%