INDEX
Explanations
expressions of support or favor towards various ideas or actions
expressions of support or opposition to various issues or policies
New Auto-Interp
Negative Logits
aunder
-0.67
vity
-0.66
yssey
-0.66
ammy
-0.64
bum
-0.63
nerv
-0.63
inel
-0.62
ixtape
-0.60
aptly
-0.60
TBA
-0.60
POSITIVE LOGITS
passionately
0.81
abortion
0.79
tarian
0.75
endorsing
0.72
unres
0.71
realDonaldTrump
0.71
roud
0.70
cair
0.69
republican
0.68
voting
0.67
Activations Density 0.215%