INDEX
Explanations
HTML/XML closing tags in text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
201
+0.13
0.7%
209
+0.12
0.7%
77
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
201
+0.13
0.04
209
+0.12
0.04
39
+0.11
0.03
Negative Logits
.[]{-1.90
.’”
-1.66
.**]{}-1.62
.](
-1.61
[]{-1.57
enen
-1.53
**]{},-1.47
reduc
-1.47
](
-1.46
.",
-1.46
POSITIVE LOGITS
ĻĤ
3.44
Ľ
3.43
·
3.39
Īĺ
3.35
¿
3.22
Ĥ¬
3.22
<|outofrange|>
3.19
↵Č
3.19
<|outofrange|>
3.19
3.19
Activations Density 0.075%