INDEX
Explanations
words related to specific computer commands and instructions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
871
+0.12
0.5%
896
+0.12
0.5%
381
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
896
+0.12
0.03
871
+0.12
0.03
370
+0.12
0.03
Negative Logits
<bos>
-1.08
Eksteraj
-0.68
Referencer
-0.67
عرض
-0.65
GEBURTSDATUM
-0.62
Economía
-0.60
Fordítás
-0.60
Хоро
-0.59
película
-0.58
gekomen
-0.58
POSITIVE LOGITS
wien
1.36
gend
1.33
lele
1.29
mef
1.28
dises
1.27
meis
1.26
bourgeo
1.25
gmbh
1.25
abnorm
1.23
grati
1.23
Activations Density 0.080%