INDEX
Explanations
politicians and their associated states and positions
political party affiliations, specifically those of senators
New Auto-Interp
Negative Logits
lining
-0.73
strat
-0.71
Ramadan
-0.68
tides
-0.63
saturation
-0.62
atro
-0.61
compilation
-0.61
latable
-0.61
gratification
-0.61
starvation
-0.60
POSITIVE LOGITS
)'
1.12
)]
1.00
)
0.96
),
0.94
)."
0.93
)-
0.89
%)
0.89
.),
0.89
),"
0.88
)),
0.87
Activations Density 0.050%