INDEX
Explanations
references to comparisons and contrasts between different entities or situations
New Auto-Interp
Negative Logits
Efq
-0.85
оригіналу
-0.73
はじめに
-0.71
ainfi
-0.70
Monfieur
-0.68
odotus
-0.68
Theſe
-0.68
tanleria
-0.68
Administrativna
-0.66
chi̍t
-0.66
POSITIVE LOGITS
than
0.88
Other
0.75
Other
0.75
besides
0.69
besides
0.61
other
0.61
other
0.61
Others
0.60
Otras
0.59
niż
0.59
Activations Density 0.720%