INDEX
Explanations
mentions of specific names and related actions or interactions on social media
references to public figures and their actions
New Auto-Interp
Negative Logits
âĢij
-0.85
glim
-0.70
©¶æ¥µ
-0.67
Abstract
-0.66
—
-0.65
,—
-0.65
inarily
-0.64
ersen
-0.64
Table
-0.63
Enlarge
-0.62
POSITIVE LOGITS
@
1.13
congr
0.96
DonaldTrump
0.95
Dems
0.93
sic
0.92
#
0.92
pics
0.91
retweet
0.90
OTUS
0.89
ðŁ
0.89
Activations Density 0.437%