INDEX
Explanations
definite articles and references to organizations or formal groups
New Auto-Interp
Negative Logits
vida
-0.15
ichi
-0.15
lea
-0.15
ufen
-0.15
šť
-0.15
mina
-0.14
sdale
-0.14
ж
-0.14
cura
-0.14
haft
-0.14
POSITIVE LOGITS
441
0.15
adam
0.14
ions
0.14
Stacy
0.14
347
0.14
instead
0.14
afari
0.14
/single
0.13
Stuart
0.13
ucc
0.13
Activations Density 0.033%