INDEX
Explanations
references to array and tensor data structures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
391
+0.16
0.9%
41
+0.10
0.6%
392
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
391
+0.16
0.02
49
+0.10
0.01
226
+0.10
0.01
Negative Logits
^âĪĴ
-1.56
^--
-1.53
MSO
-1.51
Citizens
-1.50
?"
-1.47
>/
-1.44
>=
-1.44
...?"
-1.43
Workers
-1.41
>"
-1.39
POSITIVE LOGITS
borne
1.85
ÅĽci
1.83
eries
1.82
bourg
1.81
etable
1.74
wordpress
1.67
leen
1.63
bag
1.61
enstein
1.60
nai
1.59
Activations Density 0.014%