INDEX
Explanations
punctuation marks, particularly commas
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.35
1.6%
2019
+0.16
0.7%
382
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.35
0.06
1265
+0.16
0.05
2019
+0.13
0.06
Negative Logits
<bos>
-2.86
qiao
-0.67
Paglinawan
-0.63
SPECTION
-0.61
zumal
-0.61
ⓧ
-0.59
springfox
-0.59
ostruct
-0.58
\}=\
-0.58
VERTIS
-0.58
POSITIVE LOGITS
Juf
1.37
accla
1.36
affor
1.24
maneu
1.18
increa
1.18
intrigu
1.13
impra
1.11
reluct
1.10
unspeak
1.10
disgra
1.10
Activations Density 0.333%