INDEX
Explanations
statements regarding change and its implications
New Auto-Interp
Negative Logits
deo
-0.15
conserv
-0.15
agina
-0.15
antas
-0.14
-notification
-0.14
accession
-0.14
Conservation
-0.14
Caval
-0.14
iki
-0.13
repr
-0.13
POSITIVE LOGITS
ÑĢаÑģÑĤ
0.17
eba
0.16
apus
0.14
Ľå»º
0.14
dül
0.14
cü
0.14
mij
0.14
üt
0.14
TRUE
0.14
clare
0.13
Activations Density 0.078%