INDEX
Explanations
keywords related to computer programming and functions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.16
0.5%
1445
+0.14
0.4%
1871
+0.14
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1871
+0.16
0.03
453
+0.14
0.03
1317
+0.14
0.03
Negative Logits
Juf
-1.21
thut
-1.14
fta
-1.08
akut
-1.04
kompres
-1.04
mikrofon
-1.03
silikon
-1.01
eksklu
-0.96
maneu
-0.96
gsx
-0.96
POSITIVE LOGITS
<bos>
1.15
Pozdrawiam
0.75
Roskov
0.74
Autoritní
0.71
pozdrawiam
0.71
Wię
0.70
Zgod
0.68
ější
0.68
spos
0.67
Postup
0.67
Activations Density 0.064%