INDEX
Explanations
steps or actions related to software development and testing
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
872
+0.13
0.4%
876
+0.09
0.3%
1150
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1134
+0.13
0.07
1843
+0.09
0.06
1377
+0.09
0.04
Negative Logits
Horizonte
-0.69
sartén
-0.59
ulipas
-0.58
Unito
-0.57
Bajos
-0.57
Zeneca
-0.56
terrorismo
-0.55
Baillargeon
-0.54
demment
-0.54
WQS
-0.53
POSITIVE LOGITS
impra
1.02
maneu
0.96
encomp
0.96
intersper
0.95
disreg
0.93
uninten
0.91
shenan
0.88
unlaw
0.86
resear
0.86
reluct
0.85
Activations Density 2.064%