INDEX
Explanations
references to political party affiliations, particularly focusing on the Republican Party
New Auto-Interp
Negative Logits
bipartisan
-0.19
Democrat
-0.17
Democrats
-0.16
democracy
-0.16
Democratic
-0.16
Liberals
-0.15
diplomacy
-0.15
republican
-0.15
829
-0.15
democratic
-0.14
POSITIVE LOGITS
Party
0.37
-controlled
0.29
Party
0.28
-leaning
0.25
controlled
0.23
-held
0.23
-led
0.21
PARTY
0.21
-dominated
0.20
controlled
0.20
Activations Density 0.029%