INDEX
Explanations
patterns related to mathematical or logical expressions, particularly those involving parentheses and brackets
New Auto-Interp
Negative Logits
(
-1.46
(
-1.27
[
-1.25
_
-1.10
1
-1.10
[
-1.07
-
-1.05
2
-1.04
'
-1.04
n
-1.01
POSITIVE LOGITS
"])
4.06
")));
4.06
']))
3.95
)");
3.83
"]);
3.81
})$}
3.81
')")
3.79
.)}
3.78
"]));
3.78
'))
3.69
Activations Density 0.535%