INDEX
Explanations
references to low-income individuals and families
New Auto-Interp
Negative Logits
jo
-0.16
p
-0.15
ulle
-0.14
.dm
-0.14
enn
-0.14
ially
-0.14
oplast
-0.14
caff
-0.14
Vil
-0.14
ann
-0.14
POSITIVE LOGITS
vang
0.14
"@
0.14
vale
0.14
.ua
0.14
vais
0.14
à¥įतव
0.14
ÛĮرÙĩ
0.14
ÃŃv
0.13
âĸį
0.13
hips
0.13
Activations Density 0.003%