INDEX
Neuron Alignment
Index
Value
% of L₁
221
+0.12
0.7%
310
+0.11
0.6%
72
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
188
+0.12
0.24
486
+0.11
0.16
154
+0.10
0.13
Negative Logits
es
-1.56
eker
-1.55
ahl
-1.54
mighty
-1.53
ema
-1.49
eking
-1.49
oth
-1.49
iere
-1.46
heer
-1.38
ätt
-1.36
POSITIVE LOGITS
Posts
1.50
assadors
1.34
Katie
1.26
Squadron
1.24
rons
1.23
Serge
1.22
ó
1.22
Choice
1.21
Posts
1.21
deduction
1.21
Activations Density 0.436%