INDEX
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.7%
757
+0.04
0.2%
1678
+0.04
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1324
+0.13
0.06
1443
+0.04
0.05
35
+0.04
0.05
Negative Logits
<bos>
-1.73
public
-0.78
ുറ
-0.74
addComponent
-0.69
/*
-0.68
also
-0.67
|}
-0.67
/**
-0.67
Kontrola
-0.67
build
-0.66
POSITIVE LOGITS
maneu
2.15
affor
2.14
accla
1.97
ftu
1.88
stockholm
1.88
fta
1.88
lidl
1.87
squa
1.85
disagre
1.84
strick
1.82
Activations Density 0.054%