INDEX
Explanations
function calls related to manipulating a stack or collection in programming
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.19
1.1%
59
+0.15
0.9%
369
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
59
+0.19
0.01
451
+0.15
0.01
454
+0.14
0.01
Negative Logits
NHS
-1.69
vivo
-1.46
slightest
-1.43
apparent
-1.42
cross
-1.40
adopt
-1.32
known
-1.31
contents
-1.28
observable
-1.28
observed
-1.27
POSITIVE LOGITS
manship
2.02
holder
1.99
oslav
1.85
starter
1.79
legend
1.76
ños
1.71
mania
1.68
ankind
1.67
ño
1.66
ī
1.64
Activations Density 0.007%