INDEX
Neuron Alignment
Index
Value
% of L₁
1262
+0.09
0.3%
703
+0.08
0.3%
1035
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1763
+0.09
0.02
241
+0.08
0.02
1331
+0.07
0.02
Negative Logits
public
-0.75
자
-0.71
continue
-0.70
VIRON
-0.69
ത്ത
-0.68
ുറ
-0.68
else
-0.68
下
-0.68
واح
-0.68
,
-0.67
POSITIVE LOGITS
accla
2.18
increa
2.14
Phil
2.14
affor
2.09
emphat
2.06
maneu
2.05
inev
1.99
ftu
1.95
Phil
1.94
impra
1.92
Activations Density 0.070%