INDEX
Explanations
structured data syntax or programming constructs
New Auto-Interp
Negative Logits
,
-0.80
N
-0.66
de
-0.64
-0.64
—
-0.63
else
-0.58
H
-0.58
A
-0.58
L
-0.58
S
-0.58
POSITIVE LOGITS
OGND
1.19
myſelf
1.17
reaſon
1.14
raiſ
1.11
purpoſe
1.09
poffible
1.09
neceffary
1.07
neceſſ
1.06
itſelf
1.06
pleaſure
1.06
Activations Density 0.057%