INDEX
Explanations
references to the letter "Y" in various contexts
New Auto-Interp
Negative Logits
ç¼ĺ
-0.18
inoa
-0.15
aled
-0.14
alement
-0.14
quota
-0.14
ceptar
-0.14
Threads
-0.14
ear
-0.14
Kron
-0.14
Cup
-0.14
POSITIVE LOGITS
gles
0.22
bane
0.20
dens
0.20
ilm
0.20
bar
0.19
phant
0.18
eam
0.18
zag
0.18
ank
0.17
unker
0.17
Activations Density 0.015%