INDEX
Explanations
numerical values along with some specific names
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.18
0.6%
2019
+0.16
0.5%
381
+0.14
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1678
+0.18
0.05
1343
+0.16
0.05
267
+0.14
0.04
Negative Logits
znacznie
-0.56
głó
-0.56
pelic
-0.55
squire
-0.54
rascal
-0.52
wspania
-0.51
făcut
-0.50
dokład
-0.49
pelican
-0.49
estekak
-0.49
POSITIVE LOGITS
allarg
1.07
apparti
1.03
parlar
1.01
cæ
1.00
Gemeinsame
0.98
potest
0.98
habet
0.94
Cfr
0.94
quæ
0.94
sappi
0.93
Activations Density 0.146%