INDEX
Explanations
blockquotes and quoted text
words with specific formatting or styles
HTML and code snippets
New Auto-Interp
Negative Logits
-1.03
d
-1.01
R
-0.94
Le
-0.94
B
-0.93
t
-0.92
di
-0.91
b
-0.91
to
-0.90
s
-0.90
POSITIVE LOGITS
itſelf
1.84
myſelf
1.52
Houſe
1.52
Shakspeare
1.50
Jefus
1.46
Monfieur
1.46
Anſ
1.45
houſe
1.43
pleaſure
1.41
ſelf
1.40
Activations Density 2.986%