INDEX
Explanations
specialized notations and formatting used in scientific or technical documents
New Auto-Interp
Negative Logits
e
-1.17
(
-1.09
ه
-1.07
y
-1.03
E
-1.03
u
-1.03
en
-1.03
X
-1.01
t
-1.00
X
-1.00
POSITIVE LOGITS
myſelf
2.43
Theſe
2.35
itſelf
2.25
Efq
2.25
themſelves
2.18
Anſ
2.18
himſelf
2.17
whoſe
2.04
raiſ
2.04
Reſ
2.04
Activations Density 0.874%