INDEX
Explanations
instances of structured data formats or programming constructs
Code, parentheses, brackets, and other symbols
code delimiters
New Auto-Interp
Negative Logits
RenderAtEndOf
-1.14
IntoConstraints
-1.12
<unused79>
-1.02
<unused41>
-1.02
<unused52>
-1.02
<unused14>
-1.02
<unused16>
-1.02
<unused8>
-1.02
[@BOS@]
-1.02
<unused3>
-1.02
POSITIVE LOGITS
↵↵
0.78
<eos>
0.71
↵
0.56
.
0.54
↵↵↵
0.54
2
0.51
1
0.47
The
0.43
3
0.43
<em>
0.41
Activations Density 0.730%