INDEX
Explanations
mentions of tax cuts and related financial terms within a political context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
994
+0.10
0.3%
1385
+0.08
0.2%
549
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1553
+0.10
0.04
919
+0.08
0.02
1948
+0.08
0.04
Negative Logits
secon
-1.74
increa
-1.69
desir
-1.64
volunte
-1.62
embra
-1.62
emphat
-1.62
effe
-1.61
guarante
-1.61
perfet
-1.60
inev
-1.60
POSITIVE LOGITS
for
0.69
für
0.61
для
0.61
voor
0.60
tax
0.60
offered
0.60
granted
0.58
announced
0.57
dla
0.57
from
0.56
Activations Density 0.192%