INDEX
Explanations
non-Latin characters or possibly encoded text
New Auto-Interp
Negative Logits
"<?
-0.16
ź
-0.15
dn
-0.14
ç´Ģ
-0.14
Cush
-0.14
.Toolkit
-0.14
Į¨
-0.14
обÑĭ
-0.14
explos
-0.14
robe
-0.14
POSITIVE LOGITS
Bulgarian
0.30
Bulgaria
0.29
Sofia
0.27
elerik
0.22
Telerik
0.22
Bulg
0.22
Dimit
0.19
Burg
0.18
بÙĦغ
0.17
Macedonia
0.17
Activations Density 0.093%