INDEX
Neuron Alignment
Index
Value
% of L₁
162
+0.13
0.7%
119
+0.12
0.7%
369
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
162
+0.13
0.03
235
+0.12
0.03
119
+0.12
0.02
Negative Logits
Ī
-2.87
¨
-2.81
Ĥ¬
-2.80
¯
-2.78
Īĺ
-2.77
ĥ
-2.72
ij
-2.69
Ĥ
-2.67
ĭ
-2.64
Ĭ
-2.63
POSITIVE LOGITS
://
2.85
zip
1.78
quote
1.66
typeof
1.63
typeof
1.60
wal
1.59
svg
1.56
press
1.55
doi
1.52
cord
1.50
Activations Density 0.052%