INDEX
Explanations
keywords related to the concept of quantity or numerical values
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.53
3.8%
111
+0.41
3.0%
419
+0.09
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
111
+0.53
0.20
156
+0.41
0.18
71
+0.09
0.17
Negative Logits
¿½
-2.51
↵
-2.11
↵
-2.11
↵↵
-2.11
↵
-2.11
-2.11
↵↵
-2.11
<|outofrange|>
-2.11
↵
-2.11
↵↵
-2.11
POSITIVE LOGITS
anks
1.61
agra
1.35
uclear
1.34
ovascular
1.26
ginx
1.26
ucle
1.21
ank
1.20
roscopic
1.20
atics
1.18
branches
1.18
Activations Density 0.147%