INDEX
Explanations
terms related to germs and their associated biological concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
74
+0.14
0.8%
431
+0.13
0.7%
77
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
386
+0.14
0.02
74
+0.13
0.02
431
+0.13
0.02
Negative Logits
Ļª
-2.67
-2.64
↵
-2.64
↵
-2.64
↵↵
-2.64
↵
-2.64
↵
-2.64
<|outofrange|>
-2.64
↵
-2.64
-2.64
POSITIVE LOGITS
osities
1.93
osity
1.88
ously
1.84
ulence
1.80
andom
1.79
arium
1.74
ophage
1.69
acy
1.53
unfolded
1.48
aceut
1.43
Activations Density 0.245%