INDEX
Explanations
phrases indicating a transition to a new topic or idea
references to progression or transitions in ideas or topics
New Auto-Interp
Negative Logits
she
-0.70
ricular
-0.65
quer
-0.63
anas
-0.58
gad
-0.57
stru
-0.57
sylv
-0.56
ryn
-0.56
vict
-0.56
orld
-0.54
POSITIVE LOGITS
nicely
0.90
us
0.80
me
0.77
why
0.76
neatly
0.75
Canaver
0.75
begs
0.75
inently
0.74
Patreon
0.68
Birch
0.67
Activations Density 0.238%