INDEX
Explanations
Twitter handles
Twitter handles or mentions
New Auto-Interp
Negative Logits
congreg
-0.72
circulation
-0.70
specificity
-0.70
tape
-0.70
ordinance
-0.67
aspir
-0.67
lipstick
-0.66
remake
-0.65
concentrate
-0.65
exhibit
-0.64
POSITIVE LOGITS
realDonaldTrump
1.15
#$
1.03
@@@@@@@@
1.02
thereal
0.96
nat
0.95
groups
0.91
sung
0.91
inside
0.91
BBC
0.91
TAG
0.91
Activations Density 0.019%