INDEX
Explanations
XML version declarations and elements related to XML schemas
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
307
+0.14
0.8%
143
+0.11
0.6%
197
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
307
+0.14
0.01
345
+0.11
0.01
21
+0.10
0.01
Negative Logits
.’”
-1.71
...](
-1.64
advantage
-1.64
.’
-1.54
?’
-1.52
planned
-1.51
’.
-1.46
researchers
-1.42
‘
-1.41
disadvantage
-1.33
POSITIVE LOGITS
0000000000000000000000000000000000
1.87
aho
1.68
KB
1.60
ullivan
1.51
rå
1.49
000000
1.46
kb
1.45
åı·
1.44
riterion
1.40
xffffffff
1.37
Activations Density 0.055%