INDEX
Explanations
mentions related to financial terms and government expenses
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
168
+0.12
0.4%
1557
+0.11
0.3%
1870
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1557
+0.12
0.02
168
+0.11
0.02
1948
+0.10
0.02
Negative Logits
thut
-0.64
habet
-0.62
afp
-0.61
nomine
-0.60
fatis
-0.60
wien
-0.59
kram
-0.57
xxiv
-0.57
effe
-0.56
myn
-0.56
POSITIVE LOGITS
taxpayers
1.29
taxpayer
1.25
payers
0.84
tax
0.81
Tax
0.75
payer
0.74
taxes
0.71
TAX
0.69
Tax
0.67
payers
0.66
Activations Density 0.047%