INDEX
Explanations
references to grammatical structures and language learning concepts
New Auto-Interp
Negative Logits
slang
-0.15
umd
-0.15
lint
-0.15
å¹ķ
-0.15
synonym
-0.14
ocabulary
-0.14
bul
-0.14
ewire
-0.14
(||
-0.14
ndl
-0.14
POSITIVE LOGITS
accus
0.21
morph
0.20
morphology
0.20
declined
0.20
inf
0.20
endings
0.20
Morph
0.19
decl
0.19
agreement
0.18
Case
0.18
Activations Density 0.024%