INDEX
Explanations
mentions of obligations or requirements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1331
+0.15
0.6%
680
+0.12
0.5%
411
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1331
+0.15
0.04
239
+0.12
0.04
411
+0.12
0.04
Negative Logits
Meksi
-1.10
silikon
-0.92
karton
-0.92
seksi
-0.92
optik
-0.89
saar
-0.85
keramik
-0.85
siena
-0.83
Abbé
-0.82
maksi
-0.82
POSITIVE LOGITS
must
1.06
must
1.02
Must
0.93
Must
0.88
MUST
0.88
MUST
0.77
doit
0.67
mustache
0.61
musí
0.60
mustard
0.59
Activations Density 0.082%