INDEX
Explanations
occurrences of the letter 'y' in various contexts
New Auto-Interp
Negative Logits
i
-0.32
o
-0.31
a
-0.29
r
-0.27
t
-0.24
Axis
-0.20
e
-0.20
olated
-0.20
y
-0.20
auss
-0.20
POSITIVE LOGITS
achts
0.20
tics
0.17
á»ĥm
0.17
ea
0.17
nothrow
0.16
outu
0.16
ernel
0.16
oked
0.16
ester
0.16
amaha
0.15
Activations Density 0.061%