INDEX
Explanations
variations of the letter 'k' in different contexts
New Auto-Interp
Negative Logits
outines
-0.16
871
-0.14
letes
-0.14
ford
-0.14
ium
-0.14
ainers
-0.14
_UNUSED
-0.14
اÙĩد
-0.14
ice
-0.14
ician
-0.14
POSITIVE LOGITS
inks
0.22
k
0.22
iosk
0.21
appa
0.17
essenger
0.17
:k
0.16
INET
0.16
nowledge
0.16
*k
0.16
cen
0.15
Activations Density 0.050%