INDEX
Explanations
mentions of authorship or copyright information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.14
0.8%
53
+0.14
0.8%
410
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
172
+0.14
0.04
463
+0.14
0.03
273
+0.12
0.03
Negative Logits
egg
-1.54
imetry
-1.44
etine
-1.40
ovo
-1.39
ectomy
-1.37
spot
-1.37
curative
-1.36
oret
-1.34
osse
-1.34
osite
-1.34
POSITIVE LOGITS
ĥ½
2.50
ģ
2.42
Ķ
2.22
·¸
2.15
¬
2.13
ľĵ
2.11
³
2.10
ĵ
2.08
İ
2.06
¿
2.01
Activations Density 0.073%