INDEX
Explanations
instances of coding and programming-related topics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.16
0.9%
152
+0.11
0.6%
95
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
152
+0.16
0.02
366
+0.11
0.02
195
+0.11
0.01
Negative Logits
pts
-1.66
¥
-1.58
entry
-1.50
quarters
-1.45
ľ
-1.45
]{.-1.44
lux
-1.43
yz
-1.42
entries
-1.38
hti
-1.35
POSITIVE LOGITS
argu
1.90
agne
1.52
ando
1.47
yours
1.44
(@
1.44
olis
1.43
builder
1.42
foreach
1.41
akov
1.37
>>>
1.36
Activations Density 0.036%