INDEX
Explanations
articles and determiners indicating quantity or degree
New Auto-Interp
Negative Logits
eb
-0.16
ewire
-0.15
.Predicate
-0.14
ulfilled
-0.14
ä¹İ
-0.14
.dds
-0.13
ehen
-0.13
оÑĢаз
-0.13
egin
-0.13
uno
-0.13
POSITIVE LOGITS
ATUS
0.15
alic
0.14
çĵľ
0.14
unker
0.14
ekk
0.14
ìĹŃ
0.13
vais
0.13
tendency
0.13
IRON
0.13
목
0.13
Activations Density 0.105%