INDEX
Explanations
social media handles or usernames
mentions of Twitter handles
New Auto-Interp
Negative Logits
emonium
-0.73
Vie
-0.70
Rite
-0.69
Reincarn
-0.69
ordinance
-0.67
choir
-0.67
erella
-0.67
Bradford
-0.67
frig
-0.66
pneum
-0.66
POSITIVE LOGITS
realDonaldTrump
1.11
#$
1.07
@@@@@@@@
1.00
0.98
TAG
0.94
0.89
groups
0.87
nat
0.87
link
0.85
wik
0.85
Activations Density 0.021%