INDEX
Explanations
dates and historical events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.11
0.4%
752
+0.11
0.4%
1535
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1034
+0.11
0.04
1334
+0.11
0.04
645
+0.11
0.04
Negative Logits
unspeak
-0.93
overcrow
-0.85
apprehen
-0.82
impelled
-0.78
cushi
-0.77
disagre
-0.77
ineffec
-0.76
withal
-0.74
horrend
-0.73
endeavoured
-0.73
POSITIVE LOGITS
espé
1.30
rè
1.29
meras
1.25
alkoh
1.24
anse
1.22
kosme
1.22
dì
1.18
utop
1.18
dè
1.17
kön
1.16
Activations Density 0.158%