INDEX
Explanations
expressions related to positive emotions through smiling and grinning
expressions of happiness, particularly smiles and grins
New Auto-Interp
Negative Logits
BI
-0.75
raped
-0.71
ctr
-0.71
aer
-0.70
Administ
-0.70
FER
-0.69
Mant
-0.69
assian
-0.65
æ©Ł
-0.64
ENCY
-0.64
POSITIVE LOGITS
emot
1.08
broadly
0.92
goodbye
0.91
smile
0.90
emoji
0.89
ys
0.88
radiant
0.88
brightly
0.88
bows
0.83
creen
0.83
Activations Density 0.050%