INDEX
Explanations
parts of formal bibliographic citations
New Auto-Interp
Negative Logits
alen
-0.16
usters
-0.15
orsi
-0.15
itom
-0.15
andr
-0.15
iena
-0.15
ìĤ´
-0.14
osta
-0.14
ursal
-0.14
.chars
-0.14
POSITIVE LOGITS
ktop
0.17
↵↵
0.16
MOT
0.14
га
0.14
ouri
0.14
ga
0.14
ebo
0.14
Bah
0.14
ARG
0.14
è·
0.13
Activations Density 0.157%