INDEX
Explanations
code references related to programming constructs and actions
New Auto-Interp
Negative Logits
#
-0.81
#
-0.75
int
-0.63
int
-0.63
\#
-0.57
\#
-0.49
:#
-0.49
.#
-0.48
+#
-0.47
float
-0.46
POSITIVE LOGITS
console
0.95
var
0.91
var
0.86
!';
0.85
)$;
0.85
console
0.84
})();
0.82
};
0.80
!');
0.80
'};
0.79
Activations Density 0.142%