INDEX
Explanations
structured data and code syntax
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
189
+0.16
0.9%
214
+0.12
0.7%
406
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
79
+0.16
0.10
167
+0.12
0.09
17
+0.11
0.02
Negative Logits
clock
-1.48
ally
-1.43
offices
-1.41
Fig
-1.38
fast
-1.38
bure
-1.36
coding
-1.36
awning
-1.27
amics
-1.26
alent
-1.25
POSITIVE LOGITS
ĥ½
2.10
Ĺ
1.74
ĨĴ
1.56
į
1.52
ħ
1.48
İ
1.47
ij
1.46
ķ
1.45
retire
1.45
»
1.44
Activations Density 2.374%