INDEX
Explanations
occurrences of articles and prepositions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
233
+0.11
0.6%
410
+0.10
0.6%
82
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
98
+0.11
0.16
321
+0.10
0.17
440
+0.09
0.18
Negative Logits
tons
-1.68
chen
-1.54
synthes
-1.44
forward
-1.41
analytical
-1.40
exped
-1.40
sequentially
-1.38
frequency
-1.38
fast
-1.37
elliptic
-1.33
POSITIVE LOGITS
¦
2.79
Ń
2.65
±
2.44
Ĥ¬
2.34
»¿
2.34
¾
2.32
¢
2.32
¿½
2.25
¡
2.25
Ļ
2.23
Activations Density 1.722%