INDEX
Explanations
terms related to duration and magnitude
New Auto-Interp
Negative Logits
,
-0.61
-0.60
.
-0.59
to
-0.57
the
-0.56
a
-0.56
for
-0.56
$
-0.55
↵↵
-0.53
}
-0.53
POSITIVE LOGITS
autorytatywna
1.49
########.
1.26
photolibrary
1.23
ModelExpression
1.19
―――――
1.15
GEBURTS
1.13
#+#
1.13
שוליים
1.09
мәкал
1.08
itſelf
1.08
Activations Density 0.338%