INDEX
Explanations
declarations in programming code
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.16
0.9%
457
+0.11
0.6%
485
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
178
+0.16
0.01
48
+0.11
0.01
320
+0.10
0.01
Negative Logits
Ľ
-2.24
IJ
-2.20
ĭ
-2.09
ı
-2.05
³
-2.05
ī
-2.04
¥
-2.00
²
-1.96
ij
-1.93
Īĺ
-1.92
POSITIVE LOGITS
caption
1.61
factory
1.46
actors
1.43
amycin
1.42
structures
1.41
suppressor
1.39
puted
1.37
arial
1.37
ventional
1.36
pile
1.33
Activations Density 0.651%