INDEX
Explanations
references to periods of time, particularly those involving years and weeks
New Auto-Interp
Negative Logits
of
-0.81
is
-0.60
to
-0.56
rius
-0.46
in
-0.44
are
-0.42
with
-0.42
+
-0.41
=
-0.41
there
-0.41
POSITIVE LOGITS
pleaſure
1.26
faſt
1.21
houſe
1.18
purpoſe
1.17
ſmall
1.16
")));
1.15
Diſ
1.10
ſever
1.09
ſche
1.09
NUMX
1.09
Activations Density 0.174%