INDEX
Neuron Alignment
Index
Value
% of L₁
14
+0.14
0.8%
123
+0.12
0.7%
478
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
14
+0.14
0.06
123
+0.12
0.06
434
+0.12
0.05
Negative Logits
iences
-1.71
avour
-1.60
ably
-1.55
(\<
-1.55
abl
-1.48
able
-1.47
)\].
-1.47
roles
-1.46
)\]
-1.44
quer
-1.42
POSITIVE LOGITS
↵
2.23
<|outofrange|>
2.23
↵↵↵
2.23
↵
2.23
2.23
<|outofrange|>
2.23
↵
2.23
↵
2.23
↵
2.23
2.23
Activations Density 0.285%