INDEX
Explanations
repetitive mentions of the second person pronoun "you"
you followed by auxiliary verbs
New Auto-Interp
Negative Logits
<eos>
-0.34
oeil
-0.32
$=$
-0.32
McE
-0.30
&
-0.30
lendemain
-0.29
ậc
-0.27
itself
-0.27
McEl
-0.27
<h2>
-0.26
POSITIVE LOGITS
<unused74>
0.98
<unused41>
0.98
[@BOS@]
0.98
<pad>
0.98
<unused43>
0.98
<unused14>
0.98
<unused28>
0.98
<unused16>
0.98
<unused3>
0.98
<unused8>
0.98
Activations Density 0.022%