INDEX
Explanations
the presence of the word "Ky" in various contexts
New Auto-Interp
Negative Logits
chip
-0.17
onica
-0.15
apur
-0.15
ensch
-0.15
.EXTRA
-0.14
Ĩ
-0.14
yy
-0.14
ascus
-0.14
eters
-0.14
aupt
-0.14
POSITIVE LOGITS
rgyz
0.26
Ky
0.20
rie
0.20
riad
0.19
OTO
0.19
oto
0.18
ogle
0.18
ky
0.17
ung
0.17
Ky
0.17
Activations Density 0.006%