INDEX
Explanations
punctuation marks and specific syntax elements
New Auto-Interp
Negative Logits
$<
-0.67
(>
-0.64
$>
-0.63
↓
-0.57
<=
-0.57
->
-0.55
$<
-0.55
-->
-0.55
*>
-0.54
↑
-0.54
POSITIVE LOGITS
>";
1.07
>";
1.03
>");
1.02
>");
0.98
>",
0.95
">{{0.94
>'
0.93
>",
0.91
>{0.91
">${0.91
Activations Density 0.565%