INDEX
Explanations
instances of the letter "O" in various forms
New Auto-Interp
Negative Logits
nya
-0.20
criptor
-0.19
pas
-0.19
ny
-0.18
ne
-0.17
rist
-0.16
arn
-0.16
naissance
-0.16
nea
-0.16
no
-0.16
POSITIVE LOGITS
aths
0.17
ÏħÏĩ
0.17
key
0.16
اخر
0.16
Mane
0.16
embed
0.16
en
0.16
ÙĤات
0.15
asics
0.15
ettel
0.15
Activations Density 0.097%