INDEX
Explanations
the presence of editorial content and references to editors
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.13
0.8%
410
+0.13
0.8%
187
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
410
+0.13
0.03
253
+0.13
0.02
77
+0.12
0.01
Negative Logits
proper
-1.79
populated
-1.54
else
-1.49
ago
-1.48
hereafter
-1.45
ClickListener
-1.44
true
-1.42
longer
-1.42
stic
-1.39
]{}]{}-1.36
POSITIVE LOGITS
º
3.13
«
2.84
¾
2.83
Ń
2.82
§
2.77
ļ
2.71
ĻĤ
2.67
Ĵ
2.66
¦
2.64
ł
2.63
Activations Density 0.143%