INDEX
Explanations
expressions of laughter or humor
laughter and amusement sounds
New Auto-Interp
Negative Logits
kasarigan
-0.73
]")]
-0.73
__':
-0.68
__":
-0.68
"]);
-0.68
AsUp
-0.64
")));
-0.63
+#+#
-0.61
atchewan
-0.60
'));
-0.60
POSITIVE LOGITS
Haha
0.69
haha
0.67
hahaha
0.65
haha
0.63
Hahaha
0.60
Haha
0.60
laughing
0.59
tertawa
0.59
jajaja
0.58
laugh
0.57
Activations Density 0.005%