INDEX
Explanations
document boundaries and structural markers that indicate new sections in mixed code and formal text content
various types of punctuation marks and symbols
New Auto-Interp
Negative Logits
↵
-1.39
↵↵
-1.19
_
-1.13
(
-1.02
-1.01
-0.99
{-0.94
//
-0.91
{-0.88
-0.85
POSITIVE LOGITS
^(@)
1.30
ſelves
1.25
itſelf
1.23
<bos>
1.22
pleaſure
1.20
BibitemShut
1.18
iſt
1.17
myſelf
1.17
purpoſe
1.16
reaſon
1.15
Activations Density 1.267%