INDEX
Explanations
references to animals and their descriptions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
293
+0.17
0.9%
507
+0.14
0.8%
47
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
81
+0.17
0.12
100
+0.14
0.15
256
+0.13
0.16
Negative Logits
Ĥ¬
-1.93
¾
-1.84
ĸ´
-1.83
ľĵ
-1.79
ľ
-1.79
ı
-1.71
ŀ
-1.66
Ŀ
-1.62
ij
-1.60
¯
-1.59
POSITIVE LOGITS
trillion
1.46
driven
1.42
ivism
1.31
cible
1.31
composed
1.28
etic
1.27
emig
1.26
ivist
1.25
living
1.25
travelled
1.24
Activations Density 5.958%