INDEX
Explanations
instances of words related to humor
references to humor or comedic elements
New Auto-Interp
Negative Logits
holder
-0.74
arnaev
-0.68
minster
-0.62
opio
-0.62
eded
-0.61
holders
-0.60
Countries
-0.60
ioch
-0.60
ignty
-0.60
FT
-0.60
POSITIVE LOGITS
ously
1.20
humor
0.90
humour
0.88
netflix
0.87
ably
0.79
isma
0.78
osity
0.76
aceous
0.74
satir
0.74
mocking
0.74
Activations Density 0.025%