INDEX
Neuron Alignment
Index
Value
% of L₁
1343
+0.17
0.5%
1741
+0.15
0.5%
1510
+0.14
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1510
+0.17
0.04
1343
+0.15
0.05
588
+0.14
0.05
Negative Logits
<bos>
-1.00
-0.67
/**
-0.56
desertcart
-0.54
seem
-0.54
州市
-0.54
在一
-0.54
러한
-0.52
ляє
-0.52
carried
-0.51
POSITIVE LOGITS
franz
1.36
bloss
1.35
blos
1.32
ordina
1.31
dora
1.31
haup
1.30
bordeaux
1.30
nutr
1.27
ciga
1.26
meis
1.25
Activations Density 0.313%