INDEX
Explanations
tweets mentioning a specific Twitter user
mentions of social media handles
New Auto-Interp
Negative Logits
Takeru
-0.74
ambul
-0.73
nasal
-0.71
noses
-0.68
Orig
-0.68
relegation
-0.67
Tac
-0.67
cel
-0.66
indemn
-0.65
Penal
-0.65
POSITIVE LOGITS
realDonaldTrump
1.56
GOP
1.01
truth
0.87
Real
0.86
White
0.85
abby
0.85
raw
0.84
ny
0.84
ky
0.84
Sen
0.84
Activations Density 0.022%