INDEX
Explanations
numeric references and bibliographic citations
New Auto-Interp
Negative Logits
ailer
-0.17
arov
-0.16
inkel
-0.16
oidal
-0.16
acker
-0.16
atra
-0.15
arken
-0.15
anger
-0.15
ellow
-0.15
dál
-0.15
POSITIVE LOGITS
éļĨ
0.14
Pere
0.14
orz
0.14
Silver
0.14
زاÙħ
0.14
दब
0.14
etat
0.13
ابت
0.13
berapa
0.13
Mug
0.13
Activations Density 0.013%