INDEX
Explanations
references to advantages or benefits
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.14
0.8%
206
+0.12
0.7%
241
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
206
+0.14
0.02
453
+0.12
0.02
406
+0.11
0.02
Negative Logits
sel
-1.80
affe
-1.61
ORE
-1.50
ĨĴ
-1.45
ness
-1.45
ANI
-1.42
seys
-1.42
DU
-1.41
nbsp
-1.41
Edited
-1.40
POSITIVE LOGITS
radius
1.84
ously
1.80
territory
1.65
trees
1.58
radii
1.47
area
1.47
compartments
1.47
ways
1.46
areas
1.46
crops
1.44
Activations Density 0.029%