INDEX
Explanations
references to different species and their classifications in biological contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.17
1.0%
283
+0.14
0.8%
301
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
283
+0.17
0.02
321
+0.14
0.01
301
+0.12
0.01
Negative Logits
ŀ
-3.45
º
-3.44
ĥ½
-3.38
č↵
-3.23
č↵
-3.23
-3.23
↵ ↵
-3.23
↵↵
-3.23
<|outofrange|>
-3.23
↵
-3.23
POSITIVE LOGITS
finder
1.88
optera
1.88
richness
1.80
iverse
1.70
liest
1.69
lia
1.66
remains
1.65
ional
1.62
lion
1.57
varies
1.56
Activations Density 0.101%