INDEX
Explanations
phrases that denote structure and organization in a formal context
New Auto-Interp
Negative Logits
aber
-0.16
esso
-0.15
ataire
-0.15
elman
-0.15
undred
-0.14
ãĥ¨
-0.14
ness
-0.13
,:,
-0.13
.compat
-0.13
anst
-0.13
POSITIVE LOGITS
actual
0.23
identical
0.23
usa
0.22
majority
0.21
particular
0.21
Usa
0.19
sunday
0.19
applying
0.19
actual
0.18
potency
0.18
Activations Density 0.040%