INDEX
Explanations
long strings of numbers and mathematical symbols
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1967
+0.17
0.5%
478
+0.13
0.4%
1577
+0.12
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1110
+0.17
0.01
514
+0.13
0.01
1434
+0.12
0.01
Negative Logits
pymysql
-0.96
shenan
-0.90
hairc
-0.84
paisley
-0.82
smtplib
-0.80
gaily
-0.78
intersper
-0.77
snoopy
-0.77
apprehen
-0.76
tawny
-0.72
POSITIVE LOGITS
utop
1.09
alkoh
0.95
pól
0.86
plis
0.83
rú
0.83
diagon
0.83
sement
0.83
kosme
0.83
solidar
0.82
bont
0.82
Activations Density 0.004%