INDEX
Explanations
the second-person pronoun "you"
New Auto-Interp
Negative Logits
myſelf
-1.28
Jefus
-1.26
ſtre
-1.21
ſta
-1.18
ſever
-1.18
faſt
-1.17
pleaſure
-1.17
leſs
-1.16
itſelf
-1.16
ſeveral
-1.15
POSITIVE LOGITS
you
0.88
it
0.84
I
0.81
they
0.71
us
0.69
It
0.69
them
0.64
,
0.62
that
0.62
he
0.61
Activations Density 0.221%