INDEX
Explanations
references to comedy or comedic content
references to comedy and comedic content
New Auto-Interp
Negative Logits
ignty
-0.76
hips
-0.74
eus
-0.71
violet
-0.70
ports
-0.69
ipped
-0.68
urion
-0.66
EMS
-0.66
heed
-0.66
tnc
-0.65
POSITIVE LOGITS
improvis
0.90
sketches
0.89
comedy
0.88
Schumer
0.87
comedian
0.87
comedians
0.83
Comedy
0.83
Comed
0.82
roast
0.80
improv
0.80
Activations Density 0.060%