INDEX
Explanations
mathematical symbols and notation
New Auto-Interp
Negative Logits
myſelf
-1.34
Efq
-1.17
pleaſure
-1.15
Majefty
-1.15
fhew
-1.14
itſelf
-1.14
raiſ
-1.13
chofe
-1.10
Chriftian
-1.09
Jefus
-1.08
POSITIVE LOGITS
0.58
“
0.57
;
0.50
(
0.50
The
0.49
the
0.48
↵
0.48
.
0.48
(
0.47
’
0.47
Activations Density 0.308%