INDEX
Explanations
words that signal transitions or comparisons in text
after a period
phrases referring to past, present, or future
New Auto-Interp
Negative Logits
Majefty
-0.91
Efq
-0.90
―――――
-0.86
complexContent
-0.85
herself
-0.83
Houſe
-0.82
Shakspeare
-0.81
་་
-0.81
itſelf
-0.79
EndContext
-0.79
POSITIVE LOGITS
it
1.26
we
1.25
there
1.07
they
1.03
you
0.97
,
0.96
I
0.96
this
0.86
the
0.83
he
0.72
Activations Density 0.397%