INDEX
Explanations
numerical values related to measurements or amounts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.16
0.8%
1535
+0.16
0.8%
1699
+0.15
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
478
+0.16
0.09
1967
+0.16
0.08
1108
+0.15
0.08
Negative Logits
<bos>
-3.05
ⓧ
-1.40
<?
-1.28
/**
-1.22
intersper
-1.22
-1.13
gratify
-1.03
amass
-0.93
disbur
-0.92
/*!
-0.92
POSITIVE LOGITS
seksi
1.06
silikon
0.93
vasi
0.92
optik
0.91
kafe
0.88
maksi
0.86
mikrofon
0.84
keramik
0.83
corrom
0.83
karton
0.83
Activations Density 0.238%