INDEX
Explanations
references to language learning and bilingualism
New Auto-Interp
Negative Logits
oa
-0.16
ymes
-0.14
asma
-0.14
oxy
-0.14
oval
-0.14
Levine
-0.14
ãĥ¼ãĥĹ
-0.14
loyment
-0.14
ansson
-0.14
ife
-0.14
POSITIVE LOGITS
emiz
0.16
Barbar
0.16
_singleton
0.16
enÃŃ
0.15
ahr
0.15
atural
0.14
बल
0.14
rowable
0.13
abra
0.13
_PAIR
0.13
Activations Density 0.461%