INDEX
Explanations
web URLs, particularly those associated with subscriptions and offers
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.23
0.8%
1343
+0.15
0.5%
1842
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.23
0.11
1978
+0.15
0.08
184
+0.14
0.02
Negative Logits
<bos>
-2.96
/**
-1.01
ⓧ
-1.01
Personendaten
-0.95
intersper
-0.88
<?
-0.86
Autoritní
-0.86
MessageOf
-0.82
RegressionTest
-0.81
-0.79
POSITIVE LOGITS
optik
1.16
silikon
1.16
antik
1.13
Kategor
1.12
bunda
1.08
akut
1.07
alkoh
1.05
kaos
1.04
keramik
1.03
mikrofon
1.01
Activations Density 0.602%