INDEX
Explanations
code elements related to variable assignments and comparisons
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.12
0.7%
115
+0.12
0.7%
369
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
497
+0.12
0.03
133
+0.12
0.05
345
+0.12
0.06
Negative Logits
ieri
-1.84
izer
-1.75
eries
-1.55
iously
-1.53
ously
-1.46
ientos
-1.44
absor
-1.44
Listener
-1.43
[(
-1.43
ector
-1.43
POSITIVE LOGITS
ĨĴ
2.60
ī
2.54
³
2.38
Ĺ
2.34
¬
2.32
¿
2.29
¿½
2.28
¦
2.28
¤
2.24
Īĺ
2.22
Activations Density 1.120%