INDEX
Explanations
mentions of political figures and processes, especially related to elections and campaigns
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.23
0.9%
604
+0.10
0.4%
344
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
344
+0.23
0.03
1862
+0.10
0.07
1887
+0.09
0.06
Negative Logits
<bos>
-3.12
intersper
-1.37
/***
-1.28
encomp
-1.10
quitted
-1.09
gratify
-1.03
endow
-1.03
vainly
-1.01
effectually
-0.98
disagre
-0.97
POSITIVE LOGITS
Demok
0.92
Muhamma
0.90
silikon
0.89
kosme
0.89
mikrofon
0.89
uhr
0.86
kado
0.86
keramik
0.85
optik
0.84
Demokrat
0.83
Activations Density 1.817%