INDEX
Explanations
references to Marxist concepts and principles
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.18
0.8%
604
+0.15
0.7%
872
+0.11
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1253
+0.18
0.04
872
+0.15
0.12
1420
+0.11
0.09
Negative Logits
<bos>
-4.21
don
-0.83
add
-0.82
do
-0.81
,
-0.80
get
-0.79
.
-0.79
are
-0.79
in
-0.77
re
-0.77
POSITIVE LOGITS
napoli
2.18
milano
2.06
santiago
1.98
bandung
1.97
lidl
1.95
stockholm
1.94
ibiza
1.90
pican
1.89
maroc
1.83
ricardo
1.83
Activations Density 2.964%