INDEX
Explanations
Twitter handles
Twitter handles or mentions
New Auto-Interp
Negative Logits
Vale
-0.73
congreg
-0.72
Presents
-0.68
exorc
-0.68
crossover
-0.67
implanted
-0.66
yeast
-0.65
occas
-0.65
Macedonia
-0.65
concentrating
-0.64
POSITIVE LOGITS
realDonaldTrump
1.25
#$
1.17
wik
0.95
gmail
0.90
dan
0.88
lis
0.86
gs
0.86
jon
0.86
alan
0.83
gary
0.80
Activations Density 0.022%