INDEX
Explanations
mathematical symbols and notation related to algorithms or mathematical expressions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.18
1.0%
320
+0.13
0.7%
382
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
136
+0.18
0.00
436
+0.13
0.03
369
+0.13
0.01
Negative Logits
)\].
-1.87
)</
-1.80
)\]
-1.74
):
-1.59
}</
-1.59
));
-1.58
none
-1.54
)];
-1.54
");
-1.53
)):
-1.52
POSITIVE LOGITS
dern
1.64
,\,
1.50
heels
1.49
ammat
1.46
due
1.44
inflation
1.42
»
1.41
iff
1.40
iale
1.39
amsbsy
1.38
Activations Density 0.156%