INDEX
Explanations
phrases related to joking or humor
instances of humor and speculation in the text
New Auto-Interp
Negative Logits
waters
-0.73
edom
-0.71
enf
-0.69
ãĥ¡
-0.69
BO
-0.68
por
-0.68
ĪĴ
-0.66
water
-0.66
orneys
-0.62
ebook
-0.61
POSITIVE LOGITS
sarcast
0.81
omin
0.79
aloud
0.73
llor
0.67
angrily
0.65
gloom
0.63
mourn
0.63
Wilde
0.63
aults
0.63
atively
0.63
Activations Density 0.052%