INDEX
Explanations
strings of text characters that resemble commands or communication in a digital context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
776
+0.14
0.4%
1343
+0.12
0.3%
453
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1786
+0.14
0.03
776
+0.12
0.04
1297
+0.10
0.03
Negative Logits
<bos>
-0.78
RegressionTest
-0.71
OrNil
-0.71
Autoritní
-0.69
verwijspagina
-0.65
ivelany
-0.65
Podob
-0.61
Prí
-0.61
Jako
-0.61
Демографія
-0.60
POSITIVE LOGITS
keramik
1.00
lancia
0.96
maksi
0.92
silikon
0.91
kafe
0.91
melat
0.89
seksi
0.85
akus
0.85
mikrofon
0.85
alkoh
0.84
Activations Density 0.089%