INDEX
Explanations
instructions regarding health and safety precautions
New Auto-Interp
Negative Logits
Various
-0.17
enco
-0.15
немного
-0.15
anki
-0.14
aira
-0.14
various
-0.14
åIJĦ
-0.14
è³¢
-0.14
Various
-0.13
meldung
-0.13
POSITIVE LOGITS
unless
0.33
unless
0.28
anything
0.28
directly
0.27
Unless
0.26
ANY
0.25
anything
0.24
Unless
0.24
EVER
0.23
ä»»ä½ķ
0.23
Activations Density 0.318%