INDEX
Explanations
terms related to taxation and fiscal policies
New Auto-Interp
Negative Logits
\Component
-0.18
aight
-0.15
BUF
-0.15
vey
-0.15
eliac
-0.14
icer
-0.13
yon
-0.13
ÏĦιÏĥ
-0.13
862
-0.13
ott
-0.13
POSITIVE LOGITS
orra
0.16
ê»
0.15
IRC
0.15
ools
0.15
cq
0.15
iage
0.15
iani
0.14
ertest
0.14
uet
0.14
ç©į
0.14
Activations Density 0.006%