INDEX
Explanations
components of code or programming syntax
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
261
+0.19
1.1%
92
+0.12
0.7%
199
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
261
+0.19
0.06
199
+0.12
0.06
399
+0.12
0.05
Negative Logits
ially
-1.69
vez
-1.54
astically
-1.49
ÅĪ
-1.48
iful
-1.34
ancer
-1.29
tract
-1.29
porter
-1.29
placed
-1.28
yne
-1.28
POSITIVE LOGITS
Ŀ
2.42
ĵ
2.30
ģ
2.29
ħ
2.29
ľ
2.27
ī
2.26
ij
2.25
ĻĤ
2.24
ı
2.21
Ĥ¬
2.20
Activations Density 3.877%