INDEX
Explanations
statistical and numerical information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1445
+0.16
0.5%
690
+0.09
0.3%
184
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1445
+0.16
0.06
51
+0.09
0.02
832
+0.09
0.04
Negative Logits
marte
-1.02
robus
-0.93
anse
-0.93
spion
-0.91
fasc
-0.90
monaster
-0.90
monast
-0.89
juf
-0.88
paillettes
-0.86
strass
-0.86
POSITIVE LOGITS
estimates
0.56
average
0.56
averages
0.53
total
0.52
estimated
0.50
frecuente
0.50
rates
0.50
further
0.50
another
0.48
counting
0.48
Activations Density 0.391%