INDEX
Explanations
links to different Twitter posts
hyperlinks or URLs within text
New Auto-Interp
Negative Logits
remission
-0.83
ĪĴ
-0.75
infertility
-0.71
ŃĶ
-0.70
crib
-0.69
immortality
-0.68
pie
-0.66
livest
-0.66
props
-0.66
rubble
-0.66
POSITIVE LOGITS
hash
1.24
realDonaldTrump
1.14
IJ
0.92
say
0.84
ind
0.79
AIN
0.78
jp
0.77
search
0.77
iam
0.76
/#
0.76
Activations Density 0.022%