INDEX
Explanations
references to scholarly work or citations
New Auto-Interp
Negative Logits
///<
-0.17
ä
-0.15
.communic
-0.15
adil
-0.15
unan
-0.14
eea
-0.14
pret
-0.14
oster
-0.14
isi
-0.14
undef
-0.14
POSITIVE LOGITS
Paid
0.18
Fluent
0.16
AVL
0.15
ANTE
0.15
Lakes
0.15
åĨĮ
0.15
chos
0.15
xea
0.15
Sweat
0.14
Await
0.14
Activations Density 0.067%