INDEX
Explanations
social interactions characterized by humor and playful banter
Humor and jokes
joking and pranks
New Auto-Interp
Negative Logits
ngdoc
-0.35
invokingState
-0.29
']))
-0.28
esfuerzos
-0.28
miraculously
-0.27
maleta
-0.27
desperately
-0.26
าหลี
-0.26
-0.25
opinión
-0.25
POSITIVE LOGITS
teasing
0.77
tease
0.69
prank
0.68
pranks
0.66
joke
0.65
mischie
0.65
laugh
0.63
joking
0.61
teased
0.60
mocking
0.60
Activations Density 0.225%