INDEX
Explanations
technical specifications and parameters related to computing or digital systems
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
59
+0.15
0.9%
192
+0.14
0.8%
241
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
136
+0.15
-0.00
59
+0.14
0.12
192
+0.13
0.10
Negative Logits
ochond
-1.54
este
-1.49
giving
-1.43
hed
-1.42
pler
-1.42
amer
-1.31
áĢº
-1.30
ér
-1.30
Office
-1.30
etting
-1.28
POSITIVE LOGITS
Ļª
4.93
ł
4.88
↵ ↵
4.72
<|outofrange|>
4.72
↵
4.72
4.72
<|outofrange|>
4.72
↵↵
4.72
↵↵↵
4.72
čč
4.72
Activations Density 3.098%