INDEX
Explanations
words related to humor or comedic elements
references to humor and comedic elements
New Auto-Interp
Negative Logits
holder
-0.68
arnaev
-0.67
FT
-0.62
Ag
-0.60
minster
-0.59
Peaks
-0.58
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.57
opio
-0.57
eyed
-0.57
REAM
-0.57
POSITIVE LOGITS
ously
1.29
humour
0.95
humor
0.94
ably
0.91
ingly
0.81
lessly
0.81
netflix
0.80
isma
0.76
atur
0.76
osity
0.75
Activations Density 0.017%