INDEX
Explanations
frequent references to a specific configuration or settings within a programming context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
152
+0.15
0.9%
488
+0.13
0.7%
186
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
35
+0.15
0.05
495
+0.13
0.05
96
+0.13
0.05
Negative Logits
Inflater
-1.47
Universe
-1.32
Docket
-1.32
entary
-1.32
ème
-1.30
."
-1.27
cares
-1.25
'"
-1.25
'">
-1.24
Broadcasting
-1.24
POSITIVE LOGITS
Ĵ
1.79
him
1.77
abouts
1.64
ity
1.62
»
1.54
¨
1.53
avirus
1.50
ifiable
1.49
bler
1.46
sure
1.46
Activations Density 0.079%