INDEX
Explanations
extremely common words like "the", "a", "in", "it", "my", and "that" and the word "something"
something
New Auto-Interp
Negative Logits
ſche
-2.45
pleaſure
-2.36
purpoſe
-2.30
Monfieur
-2.28
Cæsar
-2.17
quæ
-2.17
Majefty
-2.16
itſelf
-2.13
Diſ
-2.09
ſever
-2.09
POSITIVE LOGITS
ratulations
2.17
"):
1.96
—
1.87
'):
1.86
æus
1.84
')):
1.83
ficulty
1.81
iffance
1.80
polation
1.80
"])
1.77
Activations Density 6.289%