INDEX
Explanations
terms related to social welfare programs
references to social programs, particularly Social Security
New Auto-Interp
Negative Logits
nces
-0.80
sets
-0.73
Kinnikuman
-0.70
xual
-0.68
1001
-0.62
rawdownloadcloneembedreportprint
-0.62
20439
-0.62
lists
-0.60
ller
-0.60
umping
-0.60
POSITIVE LOGITS
welfare
0.95
Welfare
0.88
democr
0.82
ized
0.81
istic
0.79
welf
0.78
ists
0.77
Security
0.77
Justice
0.77
conservatives
0.76
Activations Density 0.025%