INDEX
Explanations
information related to data encryption and mathematical concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.16
0.5%
1499
+0.13
0.4%
394
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1499
+0.16
0.08
919
+0.13
0.03
615
+0.12
0.04
Negative Logits
Salón
-0.61
Solidar
-0.61
kompati
-0.57
Darío
-0.53
Belén
-0.51
Weit
-0.51
Jó
-0.51
تضيفلها
-0.50
bodas
-0.50
Renée
-0.49
POSITIVE LOGITS
perfon
0.97
xdrive
0.91
reft
0.83
tranf
0.83
ftu
0.82
pym
0.81
deere
0.81
increa
0.80
ftre
0.80
suscep
0.79
Activations Density 0.467%