INDEX
Explanations
laws, regulations, and official procedures described either internally or externally
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
297
+0.11
0.3%
1870
+0.11
0.3%
690
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1705
+0.11
0.09
2006
+0.11
0.08
297
+0.10
0.07
Negative Logits
alkoh
-1.11
meis
-1.09
sergio
-1.09
kosme
-1.09
utop
-1.08
solidar
-1.07
socie
-1.06
alberto
-1.06
sappi
-1.05
spion
-1.05
POSITIVE LOGITS
for
0.62
QUADS
0.60
are
0.60
throughout
0.59
pacs
0.58
which
0.58
with
0.58
hips
0.57
galore
0.57
contentPadding
0.57
Activations Density 0.703%