INDEX
Explanations
references to organizations, government programs, and legal terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.14
0.4%
678
+0.12
0.4%
1304
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1304
+0.14
0.05
678
+0.12
0.04
636
+0.11
0.04
Negative Logits
<bos>
-0.93
a
-0.74
(
-0.73
y
-0.70
int
-0.69
ten
-0.68
t
-0.68
а
-0.67
from
-0.67
相对
-0.67
POSITIVE LOGITS
silikon
2.15
alkoh
2.05
kram
2.05
hcm
2.05
aen
2.01
dises
1.99
keramik
1.98
mef
1.96
meis
1.95
fta
1.94
Activations Density 0.143%