INDEX
Explanations
discussions around the complexity of identity and relationships
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
186
+0.38
2.3%
207
+0.13
0.8%
451
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
186
+0.38
0.13
451
+0.13
0.03
270
+0.12
0.08
Negative Logits
>::
-1.51
>();
-1.49
reasonableness
-1.38
varchar
-1.37
testimony
-1.36
incoming
-1.32
quoted
-1.32
validity
-1.31
};
-1.30
latest
-1.27
POSITIVE LOGITS
oler
1.56
olate
1.53
erty
1.52
cler
1.51
hips
1.46
kill
1.44
landers
1.42
\[[
1.41
bos
1.36
[-
1.36
Activations Density 4.531%