INDEX
Explanations
references to stand-up comedy and the experiences of comedians
New Auto-Interp
Negative Logits
uros
-0.16
hani
-0.15
chor
-0.15
.scalablytyped
-0.14
Junk
-0.14
shint
-0.14
ibbon
-0.14
avic
-0.14
opera
-0.13
ienes
-0.13
POSITIVE LOGITS
Comedy
0.39
comedy
0.37
stand
0.32
comedian
0.31
comed
0.29
comedic
0.25
Stand
0.24
jokes
0.24
漫
0.22
stand
0.21
Activations Density 0.192%