INDEX
Explanations
names and proper nouns
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.15
0.4%
1741
+0.09
0.3%
1177
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.15
0.03
1348
+0.09
0.03
1517
+0.09
0.02
Negative Logits
ecru
-0.92
luxuriant
-0.79
pyridine
-0.77
tetrach
-0.74
friable
-0.70
calyx
-0.68
cupola
-0.67
mauve
-0.67
mohair
-0.66
annulus
-0.66
POSITIVE LOGITS
kac
0.93
kram
0.91
logis
0.90
bera
0.87
antik
0.87
reger
0.86
simplif
0.85
Kategor
0.85
panik
0.83
glan
0.82
Activations Density 0.087%