INDEX
Explanations
expressions of amusement and laughter
New Auto-Interp
Negative Logits
ModelExpression
-0.68
onCancelled
-0.66
('');
-0.66
"/",
-0.63
بيها
-0.61
?";
-0.61
*/;
-0.59
"},
-0.59
"],
-0.59
"):
-0.59
POSITIVE LOGITS
haha
0.78
lol
0.76
LOL
0.74
LOL
0.72
hahaha
0.71
Haha
0.71
HAHAHAHA
0.71
HAHA
0.70
lol
0.67
Hahaha
0.66
Activations Density 0.147%