INDEX
Explanations
adjectives or descriptors related to qualities or characteristics
New Auto-Interp
Negative Logits
al
-0.76
is
-0.74
can
-0.70
-
-0.69
only
-0.66
was
-0.65
uğ
-0.64
or
-0.64
about
-0.63
n
-0.63
POSITIVE LOGITS
Italijanski
1.32
estekak
1.23
EDEFAULT
1.16
Мексичка
1.14
'\\;'
1.14
дописавши
1.13
)";
1.10
}")
1.08
RenderAtEndOf
1.08
Himo
1.08
Activations Density 0.022%