INDEX
Explanations
phrases related to financial and corporate influence in politics and policy-making
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.17
0.5%
604
+0.09
0.2%
1143
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
198
+0.17
0.07
1143
+0.09
0.03
1360
+0.08
0.04
Negative Logits
<bos>
-0.57
apprehen
-0.56
shenan
-0.51
appreci
-0.48
Rgds
-0.45
ministres
-0.44
willy
-0.44
ineffec
-0.43
exasper
-0.42
chanced
-0.41
POSITIVE LOGITS
soggior
0.76
autunno
0.66
palio
0.66
corrom
0.64
Aéroport
0.63
pranzo
0.62
lusso
0.62
cavallo
0.62
prenota
0.61
Muhamma
0.61
Activations Density 0.449%