INDEX
Explanations
references to comparison and contrast between different subjects or items
New Auto-Interp
Negative Logits
isor
-0.16
ignum
-0.15
ant
-0.14
ixa
-0.14
á¿¶
-0.14
anganese
-0.14
é±
-0.14
oline
-0.14
innoc
-0.13
lif
-0.13
POSITIVE LOGITS
others
0.18
others
0.16
wor
0.16
ساÛĮر
0.16
other
0.15
otras
0.15
’autres
0.15
Others
0.15
'autres
0.15
Others
0.15
Activations Density 0.309%