INDEX
Explanations
Twitter handles beginning with '@'
mentions of social media handles or tags
New Auto-Interp
Negative Logits
emonium
-0.73
pneum
-0.73
circulation
-0.68
rebell
-0.64
liner
-0.64
Primordial
-0.64
monary
-0.64
Dise
-0.63
contracting
-0.63
Xiang
-0.62
POSITIVE LOGITS
#$
1.48
@@@@@@@@
1.14
realDonaldTrump
1.09
Home
0.94
aic
0.92
gmail
0.89
home
0.83
TE
0.80
las
0.79
GS
0.78
Activations Density 0.013%