INDEX
Explanations
references to threading and task management in programming
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.21
1.2%
493
+0.14
0.8%
230
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
493
+0.21
0.02
80
+0.14
0.02
17
+0.13
0.00
Negative Logits
ł
-4.02
↵ ↵
-3.71
↵
-3.71
↵ ↵
-3.71
-3.71
-3.71
-3.71
↵
-3.71
-3.71
<|outofrange|>
-3.71
POSITIVE LOGITS
bare
2.25
ings
1.82
pool
1.64
nai
1.60
aloud
1.57
behind
1.51
behind
1.47
ing
1.47
uelle
1.46
ago
1.46
Activations Density 0.116%