INDEX
Explanations
programming-related terms and structures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
26
+0.14
0.8%
37
+0.13
0.7%
193
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
53
+0.14
0.04
17
+0.13
0.01
193
+0.12
0.04
Negative Logits
exhibits
-1.75
zel
-1.48
surrounds
-1.45
dez
-1.44
mos
-1.43
estimate
-1.42
senses
-1.41
exhibit
-1.39
igu
-1.37
hydrodynamic
-1.36
POSITIVE LOGITS
cmd
1.90
defeating
1.61
uit
1.55
microsoft
1.55
begin
1.54
Equal
1.52
ctrine
1.52
hline
1.51
False
1.48
ollywood
1.48
Activations Density 0.377%