INDEX
Neuron Alignment
Index
Value
% of L₁
376
+0.16
0.9%
23
+0.14
0.8%
494
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
359
+0.16
0.03
496
+0.14
0.02
443
+0.14
0.02
Negative Logits
ipore
-1.88
ented
-1.84
atories
-1.68
Pradesh
-1.67
erated
-1.63
idopsis
-1.58
arios
-1.57
enium
-1.57
olved
-1.56
ubot
-1.54
POSITIVE LOGITS
¬
2.79
ı
2.70
¤
2.69
IJ
2.57
Ģ
2.57
¥
2.54
¸
2.54
¯
2.53
Ļ
2.45
¾
2.44
Activations Density 0.148%