INDEX
Explanations
the repetition of the word "you" in various contexts
New Auto-Interp
Negative Logits
KN
-0.18
enge
-0.16
ÙĥÙĪÙĨ
-0.15
instein
-0.14
uron
-0.14
KN
-0.14
Prem
-0.14
auses
-0.14
oux
-0.14
glas
-0.14
POSITIVE LOGITS
reich
0.15
Clark
0.14
array
0.14
='".$_
0.14
ady
0.14
онов
0.14
Covered
0.13
ìĬ¤íĨł
0.13
ιÏĥÏĦο
0.13
Higgins
0.13
Activations Density 0.036%