INDEX
Explanations
mentions of politicians and political activities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
204
+0.13
0.4%
1506
+0.12
0.4%
1053
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
204
+0.13
0.04
1506
+0.12
0.04
395
+0.12
0.03
Negative Logits
paula
-0.88
affez
-0.86
increa
-0.85
fortn
-0.81
effe
-0.80
secon
-0.79
sogget
-0.79
purcha
-0.79
volunte
-0.79
scrat
-0.79
POSITIVE LOGITS
politicians
1.11
politician
1.03
politician
0.83
lawmakers
0.74
cillors
0.67
ticians
0.66
legislators
0.65
lawmaker
0.65
políticos
0.64
political
0.62
Activations Density 0.132%