INDEX
Explanations
structured data representations and attributes associated with items like sizes and references
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
121
+0.15
0.9%
143
+0.12
0.7%
499
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
249
+0.15
0.06
121
+0.12
0.08
155
+0.10
0.15
Negative Logits
block
-1.49
etc
-1.45
">
-1.34
processor
-1.33
storing
-1.33
sembles
-1.27
baum
-1.27
ew
-1.26
stored
-1.25
thumb
-1.22
POSITIVE LOGITS
¿½
2.16
ĥ½
2.12
ł
1.91
Į
1.89
Ī
1.86
ĩ
1.85
Ł
1.83
Ŀ
1.75
į
1.70
ī
1.68
Activations Density 5.611%