INDEX
Explanations
elements related to hyperlinks and data structures in code
New Auto-Interp
Negative Logits
←
-0.57
←
-0.56
<--
-0.50
</h2>
-0.49
EnableWeb
-0.47
"
-0.43
'
-0.42
",
-0.42
";
-0.41
';
-0.41
POSITIVE LOGITS
>
1.86
>,
1.43
>*/
1.38
>;
1.37
>.
1.37
>\
1.37
>$
1.37
>
1.36
>"
1.34
>`
1.31
Activations Density 1.130%