INDEX
Neuron Alignment
Index
Value
% of L₁
276
+0.14
0.8%
49
+0.13
0.7%
100
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
49
+0.14
0.02
276
+0.13
0.02
128
+0.12
0.02
Negative Logits
ères
-1.75
opes
-1.62
aphyl
-1.56
érie
-1.55
ses
-1.54
EVER
-1.51
ère
-1.50
s
-1.47
ANCE
-1.46
ONS
-1.42
POSITIVE LOGITS
floor
1.84
borg
1.69
bank
1.65
level
1.61
dynamics
1.53
veteran
1.53
weed
1.50
walk
1.49
voltage
1.48
ño
1.47
Activations Density 0.018%