INDEX
Explanations
technical terms and equations related to physics and mathematics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
419
+0.14
0.8%
448
+0.11
0.6%
12
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
17
+0.14
0.00
320
+0.11
0.03
136
+0.11
-0.01
Negative Logits
apple
-1.75
nothing
-1.52
"></
-1.49
want
-1.46
documentation
-1.46
pops
-1.45
)...
-1.44
...)
-1.41
asek
-1.40
Answer
-1.40
POSITIVE LOGITS
»
2.56
¾
2.52
Īĺ
2.52
Ĩ
2.41
¦
2.37
®
2.37
ĨĴ
2.36
Ŀ
2.32
°
2.30
«
2.21
Activations Density 0.304%