INDEX
Explanations
programming syntax and structure elements
New Auto-Interp
Negative Logits
rungsseite
-0.93
++
-0.88
`,
-0.85
])):
-0.85
%"),
-0.83
^(@)
-0.82
')):
-0.81
]$}
-0.81
*/;
-0.81
%";
-0.80
POSITIVE LOGITS
↵↵
1.72
↵↵↵
0.95
↵↵↵↵
0.92
<eos>
0.70
↵↵↵↵↵
0.69
↵↵↵↵↵↵
0.68
...
0.65
↵↵↵↵↵↵↵
0.61
0.57
:)
0.57
Activations Density 0.195%