INDEX
Explanations
expressions related to emotions and reactions, especially those showing some level of discomfort or disbelief
intense emotional expressions and reactions
New Auto-Interp
Negative Logits
arthed
-0.92
sites
-0.73
ricanes
-0.70
ctr
-0.70
senal
-0.69
Sources
-0.69
é¾į
-0.68
GOODMAN
-0.67
æ©Ł
-0.67
£ı
-0.65
POSITIVE LOGITS
enance
1.15
grin
1.00
demeanor
0.97
smile
0.93
expression
0.91
frown
0.86
smiles
0.85
innocence
0.84
Expression
0.84
laughter
0.83
Activations Density 0.236%