INDEX
Explanations
references to nutrients, vitamins, and their health impacts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.22
1.3%
184
+0.13
0.8%
176
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
186
+0.22
0.03
32
+0.13
0.03
321
+0.11
0.03
Negative Logits
ĥ½
-4.09
↵↵
-4.00
↵ âĢĥ
-4.00
↵
-4.00
<|outofrange|>
-4.00
↵
-4.00
↵
-4.00
<|outofrange|>
-4.00
-4.00
↵ ↵
-4.00
POSITIVE LOGITS
mith
2.31
ystems
2.17
pace
2.15
hell
2.13
elves
1.93
PEC
1.93
creen
1.92
ource
1.87
ystem
1.87
apon
1.85
Activations Density 0.310%