INDEX
Explanations
specific programming syntax and structure, particularly related to object initialization and method definitions in code
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
394
+0.13
0.7%
338
+0.11
0.6%
380
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
79
+0.13
0.03
131
+0.11
0.03
84
+0.11
0.03
Negative Logits
orer
-1.69
liness
-1.64
lessness
-1.61
)\].
-1.56
.).
-1.53
:**
-1.52
odend
-1.50
.);
-1.49
vier
-1.48
.](
-1.47
POSITIVE LOGITS
ij
4.13
Ĵ
4.13
ĵ
4.08
↵
4.00
↵↵
4.00
<|outofrange|>
4.00
č↵
4.00
↵↵↵
4.00
↵
4.00
↵ ↵
4.00
Activations Density 0.294%