INDEX
Explanations
mentions of email notifications or registration forms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1757
+0.16
0.8%
1806
+0.14
0.7%
805
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.16
0.04
1896
+0.14
0.03
1870
+0.13
0.01
Negative Logits
<bos>
-1.46
ⓧ
-0.56
MemoryWarning
-0.56
Referencie
-0.55
Iné
-0.54
бъ
-0.54
RectangleBorder
-0.52
<?
-0.52
Vanjske
-0.51
@[+][
-0.51
POSITIVE LOGITS
lele
1.03
chèvre
0.89
Manufact
0.89
thuy
0.86
notification
0.84
sirup
0.84
withal
0.83
Aéroport
0.83
toul
0.82
myn
0.82
Activations Density 0.401%