INDEX
Explanations
scientific journal citations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
283
+0.17
0.5%
906
+0.13
0.4%
1741
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
283
+0.17
0.01
924
+0.13
0.02
718
+0.11
0.02
Negative Logits
sculptured
-1.14
boughs
-1.12
unspeak
-1.09
tupperware
-1.05
cushi
-1.04
shenan
-0.93
gaily
-0.93
fringed
-0.91
McLaugh
-0.91
hairc
-0.91
POSITIVE LOGITS
Kategor
1.73
alkoh
1.68
elek
1.67
stoff
1.62
makro
1.60
antik
1.59
marte
1.59
fré
1.57
Strukt
1.54
biograf
1.53
Activations Density 0.077%