INDEX
Explanations
suggestions for improving or providing examples in documentation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1387
+0.13
0.5%
1974
+0.13
0.5%
871
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1974
+0.13
0.07
1387
+0.13
0.07
871
+0.12
0.06
Negative Logits
poulet
-0.82
chèvre
-0.76
frambo
-0.75
pommes
-0.74
patata
-0.73
bakso
-0.73
poire
-0.72
churras
-0.70
persil
-0.70
bandung
-0.69
POSITIVE LOGITS
can
0.85
can
0.84
contribue
0.76
Can
0.73
Can
0.72
CAN
0.72
affor
0.69
0.67
CAN
0.66
be
0.65
Activations Density 0.241%