INDEX
Explanations
references to the word "Les."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
303
+0.16
0.9%
1045
+0.13
0.7%
50
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
303
+0.16
0.02
1335
+0.13
0.02
1178
+0.12
0.02
Negative Logits
<bos>
-1.59
/**
-0.79
<?
-0.77
Quoi
-0.74
Aún
-0.72
-0.69
Aucune
-0.69
Autre
-0.68
jątk
-0.67
Celui
-0.67
POSITIVE LOGITS
Les
1.29
LES
1.15
Les
1.13
les
1.06
hina
1.03
saar
1.02
les
0.96
Las
0.95
magis
0.94
sii
0.91
Activations Density 0.080%