INDEX
Explanations
references to statistical percentages and figures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
206
+0.11
0.6%
479
+0.11
0.6%
59
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
479
+0.11
0.04
511
+0.11
0.04
206
+0.11
0.04
Negative Logits
Inflater
-1.75
APTER
-1.56
WM
-1.55
sts
-1.53
ESULT
-1.51
untime
-1.47
ETHOD
-1.46
METHODS
-1.45
SERVICES
-1.41
mt
-1.40
POSITIVE LOGITS
ois
1.72
enty
1.71
talks
1.64
imet
1.58
qué
1.47
thereof
1.39
quarters
1.38
ière
1.37
cuts
1.37
rose
1.37
Activations Density 3.169%