INDEX
Explanations
descriptions of technical features in software development
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.20
0.6%
1403
+0.14
0.4%
1284
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
876
+0.20
-0.00
678
+0.14
0.04
1284
+0.13
0.04
Negative Logits
kosme
-0.87
Demokrat
-0.87
bakteri
-0.86
kalori
-0.82
radikal
-0.82
Perú
-0.81
silikon
-0.81
kriminal
-0.80
akade
-0.79
Educación
-0.79
POSITIVE LOGITS
scrat
2.02
eiffel
2.00
affor
1.93
increa
1.92
guarante
1.91
encomp
1.87
maneu
1.86
milf
1.85
lamborghini
1.81
squa
1.81
Activations Density 0.263%