INDEX
Neuron Alignment
Index
Value
% of L₁
2019
+0.24
0.8%
678
+0.16
0.6%
50
+0.16
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
545
+0.24
0.06
1404
+0.16
0.06
678
+0.16
0.06
Negative Logits
so
-0.68
in
-0.66
as
-0.65
tupperware
-0.64
im
-0.62
占用
-0.62
he
-0.61
sign
-0.61
let
-0.60
her
-0.60
POSITIVE LOGITS
alkoh
1.24
silikon
1.23
antik
1.23
kafe
1.21
kosme
1.19
praktik
1.18
optik
1.15
mikrofon
1.14
keramik
1.14
panik
1.14
Activations Density 0.247%