INDEX
Explanations
code delimiters, special characters, and newlines
New Auto-Interp
Negative Logits
■
1.13
.
1.12
■
1.04
•
1.03
0.94
.•
0.92
••
0.91
0.86
0.85
•
0.84
POSITIVE LOGITS
`
3.48
`
2.54
`$
2.54
`.
2.38
(`
2.32
`<
2.30
`'
2.28
`#
2.27
`{2.23
`/
2.21
Activations Density 3.424%