INDEX
Explanations
the word "know"
the phrase "you know" as a discourse marker
New Auto-Interp
Negative Logits
utenberg
-0.85
anmar
-0.77
stad
-0.75
omal
-0.71
iets
-0.71
upe
-0.70
otom
-0.69
cius
-0.68
ermanent
-0.66
aez
-0.65
POSITIVE LOGITS
lege
0.81
ledged
0.80
ledge
0.74
terday
0.73
exactly
0.66
abella
0.66
how
0.65
KNOW
0.64
guessed
0.64
whats
0.63
Activations Density 0.033%