INDEX
Explanations
political and social statements related to the Republican party and conservatism
references to political power dynamics and partisanship
New Auto-Interp
Negative Logits
Trave
-0.75
.","
-0.72
)",
-0.71
)."
-0.68
".[
-0.67
.",
-0.64
]."
-0.64
ensu
-0.63
iHUD
-0.63
VERTISEMENT
-0.62
POSITIVE LOGITS
goddamn
0.96
stupid
0.90
shitty
0.88
fucking
0.84
pissed
0.82
godd
0.81
crappy
0.81
idiots
0.80
dumb
0.80
fucked
0.80
Activations Density 1.198%