INDEX
Explanations
singular indefinite articles and their variations in text
New Auto-Interp
Negative Logits
aly
-0.16
ãĥªãĤ«
-0.15
ioc
-0.15
ario
-0.14
ped
-0.14
vil
-0.14
kbd
-0.14
Ù쨩
-0.14
king
-0.14
дÑĭ
-0.14
POSITIVE LOGITS
Outlet
0.19
onu
0.18
serter
0.16
akter
0.16
alone
0.16
heits
0.15
myfile
0.14
zelf
0.14
iras
0.14
asto
0.14
Activations Density 0.010%