INDEX
Explanations
references to political figures and their actions or statements
New Auto-Interp
Negative Logits
Conserv
-0.32
conservatism
-0.31
conservatives
-0.30
Conservatives
-0.30
conserv
-0.27
Conservative
-0.27
GOP
-0.25
conservative
-0.24
GOP
-0.24
Republican
-0.23
POSITIVE LOGITS
Democratic
0.48
Dem
0.45
DEM
0.44
progressive
0.43
Democr
0.42
Dem
0.41
æ°ij主
0.40
DEM
0.39
Democratic
0.39
Democrats
0.38
Activations Density 0.206%