INDEX
Explanations
instances of the letter "l" in the text
New Auto-Interp
Negative Logits
oi
-0.18
onis
-0.16
Kral
-0.15
ijk
-0.15
ongo
-0.15
ex
-0.14
ink
-0.14
overe
-0.14
iefs
-0.14
hoe
-0.14
POSITIVE LOGITS
l
0.35
ager
0.19
ichen
0.18
urch
0.18
amination
0.18
*l
0.17
ags
0.17
=l
0.17
agoon
0.17
:l
0.16
Activations Density 0.021%