INDEX
Explanations
terms related to technology and programming concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.17
0.7%
1871
+0.14
0.6%
678
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1871
+0.17
0.08
876
+0.14
0.00
304
+0.12
0.01
Negative Logits
<bos>
-2.62
can
-0.90
***!
-0.89
ⓧ
-0.89
to
-0.88
/*
-0.87
HasColumnType
-0.87
<eos>
-0.87
in
-0.85
for
-0.85
POSITIVE LOGITS
increa
3.41
?...
3.40
fta
3.39
!...
3.35
wien
3.34
affor
3.29
emphat
3.27
effe
3.23
ftu
3.17
aen
3.17
Activations Density 0.501%