INDEX
Explanations
terms related to social welfare and government benefits
references to welfare and its implications
New Auto-Interp
Negative Logits
Brun
-0.72
Sie
-0.68
Ou
-0.66
humidity
-0.62
Hundred
-0.61
Hole
-0.61
ergy
-0.58
deaf
-0.57
gran
-0.57
Grind
-0.56
POSITIVE LOGITS
elfare
1.32
welfare
1.15
entitle
0.91
benefit
0.89
Welfare
0.88
recipients
0.87
odynam
0.82
odynamics
0.80
reform
0.80
tarian
0.76
Activations Density 0.008%