INDEX
Explanations
sentences that start with "You know," or similar phrases
repetitive phrases that initiate with "You know."
New Auto-Interp
Negative Logits
士
-0.80
omal
-0.76
aq
-0.75
entials
-0.74
erity
-0.73
pak
-0.71
rehend
-0.70
ãĤº
-0.70
uscript
-0.69
HL
-0.67
POSITIVE LOGITS
uh
0.80
maybe
0.79
kinda
0.72
anecd
0.71
depending
0.70
sensing
0.69
sort
0.65
whatever
0.65
soType
0.64
yeah
0.64
Activations Density 0.051%