INDEX
Explanations
proper nouns and terms related to names or labeling
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
32
+0.14
0.5%
553
+0.12
0.5%
67
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
553
+0.14
0.04
32
+0.12
0.04
1565
+0.12
0.04
Negative Logits
liberi
-0.66
trist
-0.65
vinci
-0.64
loue
-0.63
fatis
-0.63
pép
-0.60
Spirito
-0.60
ché
-0.60
serre
-0.60
majest
-0.60
POSITIVE LOGITS
name
1.19
name
1.10
names
1.10
Name
1.08
names
1.06
NAME
1.02
Names
1.01
NAME
0.98
Name
0.98
getName
0.94
Activations Density 0.099%