INDEX
Explanations
expressions of strong emotion or dramatic statements
New Auto-Interp
Negative Logits
Beschreibung
-0.31
省市镇
-0.28
상세
-0.28
akt
-0.26
commun
-0.25
Grit
-0.25
是大
-0.25
зонта
-0.24
chapa
-0.24
手
-0.23
POSITIVE LOGITS
joke
0.68
المعيارى
0.68
sarcas
0.68
jokingly
0.67
joking
0.67
Humor
0.67
sarcasm
0.66
AndEndTag
0.65
chuckle
0.65
laughter
0.64
Activations Density 0.065%