INDEX
Explanations
punctuation and special symbols
Text followed by colons or brackets
numbers and specific terms like "final"
New Auto-Interp
Negative Logits
N
-0.58
S
-0.58
No
-0.58
a
-0.56
L
-0.56
“
-0.56
I
-0.56
E
-0.55
P
-0.55
Al
-0.55
POSITIVE LOGITS
1.13
Theſe
1.01
Monfieur
0.92
myſelf
0.88
themſelves
0.87
Shakspeare
0.87
quæ
0.86
itſelf
0.85
uſed
0.82
becauſe
0.80
Activations Density 2.142%