INDEX
Explanations
technical terms related to software components or coding
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
893
+0.13
0.5%
395
+0.12
0.5%
122
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
395
+0.13
0.02
893
+0.12
0.02
325
+0.12
0.02
Negative Logits
Darío
-0.90
alberto
-0.82
philanth
-0.80
accla
-0.79
indestru
-0.77
jorge
-0.77
sergio
-0.75
Mónica
-0.74
disgra
-0.72
unspeak
-0.72
POSITIVE LOGITS
component
1.40
Component
1.33
components
1.31
component
1.28
Components
1.24
Component
1.24
components
1.17
COMPONENT
1.07
Components
1.05
componente
1.00
Activations Density 0.053%