INDEX
Explanations
descriptions of shapes and structures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1385
+0.20
0.6%
872
+0.11
0.3%
2016
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1705
+0.20
0.04
736
+0.11
0.04
678
+0.09
0.04
Negative Logits
drap
-0.77
klu
-0.74
notor
-0.73
espé
-0.72
stoff
-0.72
incess
-0.71
sogget
-0.71
cabrio
-0.71
Sitten
-0.70
robus
-0.69
POSITIVE LOGITS
shaped
0.97
shape
0.89
shaped
0.85
shapes
0.73
shape
0.72
haped
0.64
Shaped
0.63
pattern
0.61
Shaped
0.58
Shape
0.58
Activations Density 0.279%