INDEX
Explanations
terminology related to scientific theories and methods
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
872
+0.21
0.6%
1253
+0.12
0.4%
203
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
872
+0.21
0.08
203
+0.12
0.04
563
+0.09
0.03
Negative Logits
UnusedPrivate
-0.68
AddHtmlAttribute
-0.65
BeginInit
-0.64
EndInit
-0.63
***!
-0.61
smithy
-0.60
UDAD
-0.58
Javadoc
-0.57
anglès
-0.57
RegressionTest
-0.57
POSITIVE LOGITS
increa
1.36
impra
1.34
hentai
1.34
disreg
1.34
milf
1.32
maneu
1.32
snoopy
1.31
shenan
1.30
stickied
1.29
intersper
1.28
Activations Density 0.695%