INDEX
Explanations
terms related to government social welfare programs, particularly Social Security
references to social security and related policies
New Auto-Interp
Negative Logits
nces
-0.80
oning
-0.71
oned
-0.69
llers
-0.66
ously
-0.64
1001
-0.64
xual
-0.63
landfall
-0.63
peed
-0.63
needless
-0.62
POSITIVE LOGITS
Health
0.81
Insurance
0.77
Flow
0.76
ists
0.75
Welfare
0.74
nect
0.74
ized
0.74
Security
0.73
welfare
0.72
ité
0.71
Activations Density 0.019%