INDEX
Explanations
methods and descriptions in procedural or instructional contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
263
+0.18
1.0%
198
+0.14
0.8%
478
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
354
+0.18
0.09
278
+0.14
0.06
130
+0.13
0.11
Negative Logits
»¿
-2.43
į
-2.20
¿½
-2.06
ľ
-1.97
«
-1.90
¡
-1.86
possessions
-1.81
¾
-1.76
·
-1.74
ĭ
-1.74
POSITIVE LOGITS
apest
1.95
method
1.75
methods
1.64
method
1.59
iterative
1.59
optimization
1.57
asone
1.55
inductive
1.51
techniques
1.46
designed
1.43
Activations Density 2.114%