INDEX
Explanations
instances of the letter 'y' in various contexts
New Auto-Interp
Negative Logits
t
-0.28
ãĥ³
-0.28
yı
-0.28
on
-0.26
à¸ģ
-0.26
o
-0.25
ar
-0.25
i
-0.25
k
-0.24
h
-0.23
POSITIVE LOGITS
اÙģØªÙĩ
0.16
ãĤĭãģ¨
0.16
blo
0.16
inou
0.16
asl
0.15
edx
0.15
dess
0.15
elib
0.15
timeofday
0.15
ponge
0.15
Activations Density 0.061%