INDEX
Neuron Alignment
Index
Value
% of L₁
376
+0.15
0.8%
274
+0.13
0.7%
480
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
480
+0.15
0.02
274
+0.13
0.02
372
+0.11
0.02
Negative Logits
¹
-2.88
ĵ
-2.71
ĨĴ
-2.68
IJ
-2.53
©
-2.52
Ĵ
-2.51
↵
-2.51
↵
-2.51
-2.51
č↵č↵
-2.51
POSITIVE LOGITS
istics
1.84
ulated
1.80
capacity
1.73
exercises
1.71
emann
1.66
records
1.59
room
1.58
equations
1.52
enses
1.52
occasions
1.52
Activations Density 0.011%