INDEX
Explanations
concepts related to political and economic issues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.38
1.5%
184
+0.16
0.6%
1137
+0.11
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.38
0.05
1137
+0.16
0.05
204
+0.11
0.03
Negative Logits
kosme
-0.83
Historie
-0.76
protokol
-0.74
klinik
-0.73
etik
-0.72
mikrofon
-0.72
grafik
-0.70
ikon
-0.69
koordin
-0.69
Kombin
-0.69
POSITIVE LOGITS
unspeak
1.46
apprehen
1.22
intersper
1.21
shenan
1.19
indescri
1.19
indestru
1.18
inconce
1.17
impelled
1.16
ardour
1.10
pamph
1.09
Activations Density 0.287%