INDEX
Explanations
phrases related to reflection and contemplation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1381
+0.09
0.2%
324
+0.08
0.2%
1745
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1020
+0.09
0.04
586
+0.08
0.03
1001
+0.07
0.03
Negative Logits
fta
-2.38
squa
-2.22
effe
-2.16
increa
-2.16
fte
-2.15
reluct
-2.15
ftu
-2.11
inev
-2.11
unden
-2.11
fto
-2.09
POSITIVE LOGITS
<bos>
1.05
certainly
0.93
definitely
0.82
makeConstraints
0.75
sure
0.69
nor
0.69
at
0.66
do
0.66
openConnection
0.64
however
0.63
Activations Density 0.199%