INDEX
Explanations
code snippets or structural elements related to programming languages
New Auto-Interp
Negative Logits
'
-0.65
";
-0.62
%
-0.61
';
-0.60
?>
-0.60
L
-0.60
<
-0.59
=[]
-0.59
T
-0.59
/
-0.58
POSITIVE LOGITS
//
1.38
(
1.24
"
1.13
{1.04
$
1.04
[
1.01
#
0.91
“
0.81
\
0.79
<
0.77
Activations Density 0.318%