INDEX
Explanations
nested structures or elements in a document
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
410
+0.13
0.7%
478
+0.12
0.7%
51
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
175
+0.13
0.03
51
+0.12
0.03
341
+0.12
0.03
Negative Logits
illin
-1.66
ite
-1.60
around
-1.54
augh
-1.54
ually
-1.45
wise
-1.44
rew
-1.38
.[]{-1.35
inating
-1.32
istically
-1.32
POSITIVE LOGITS
IJ
3.68
¡
3.49
ij
3.38
ĨĴ
3.35
ĥ½
3.35
ķ
3.31
ļ
3.24
»
3.23
¥
3.22
ĸ
3.16
Activations Density 0.100%