INDEX
Explanations
phrases that end with a witty or clever one-liner
humorous elements or comedic expressions in dialogue
New Auto-Interp
Negative Logits
MFT
-0.79
ranch
-0.79
Horizons
-0.74
osponsors
-0.71
agate
-0.70
GOODMAN
-0.67
Branch
-0.64
Located
-0.63
biodiversity
-0.63
Sovere
-0.63
POSITIVE LOGITS
uttered
1.20
aloud
1.20
inco
1.15
sarcast
1.14
unint
1.10
gib
1.07
("1.07
apologizing
1.03
politely
0.97
implying
0.96
Activations Density 0.478%