INDEX
Explanations
actions and statements related to decision-making and communication in a political context
New Auto-Interp
Negative Logits
eers
-0.64
Pse
-0.63
their
-0.63
themselves
-0.61
itself
-0.61
guiActiveUnfocused
-0.61
glers
-0.61
alike
-0.59
Fail
-0.59
folk
-0.59
POSITIVE LOGITS
personally
1.12
poke
0.86
"#
0.84
resign
0.80
constituents
0.77
"'
0.74
oan
0.71
"
0.70
realDonaldTrump
0.67
constituent
0.67
Activations Density 0.376%