INDEX
Explanations
terms related to social welfare programs, particularly Social Security
mentions of social welfare programs, particularly Social Security
New Auto-Interp
Negative Logits
Äĩ
-0.83
20439
-0.73
nces
-0.72
icular
-0.70
xual
-0.69
needless
-0.67
ridden
-0.67
upon
-0.63
llers
-0.63
pless
-0.62
POSITIVE LOGITS
ists
0.80
Aff
0.79
Health
0.76
Flow
0.76
Responsibility
0.76
Works
0.74
Security
0.74
Work
0.73
Equality
0.73
Social
0.72
Activations Density 0.010%