INDEX
Explanations
references to singleton instances in programming or data structures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.25
1.5%
376
+0.17
1.0%
382
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
292
+0.25
0.39
310
+0.17
0.31
169
+0.11
0.28
Negative Logits
ub
-1.40
xy
-1.36
uke
-1.35
____________
-1.33
ellow
-1.31
Eve
-1.30
uff
-1.30
avier
-1.29
orporated
-1.29
onas
-1.29
POSITIVE LOGITS
types
1.83
hood
1.80
erals
1.79
pile
1.68
life
1.63
"}](#
1.63
ranges
1.60
plicity
1.60
mythology
1.57
stown
1.54
Activations Density 0.919%