INDEX
Explanations
terms related to scientific measurement and quantification
New Auto-Interp
Negative Logits
myſelf
-1.15
themſelves
-1.12
UnusedPrivate
-1.11
pleaſure
-1.10
RenderAtEndOf
-1.06
ſelves
-1.05
ſelf
-1.05
ſeveral
-1.04
Reſ
-1.04
ſever
-1.04
POSITIVE LOGITS
0.58
,
0.58
that
0.53
as
0.47
on
0.47
to
0.45
(
0.44
D
0.44
e
0.43
.
0.43
Activations Density 1.537%