INDEX
Explanations
Twitter usernames/accounts
handles or usernames mentioned on Twitter
New Auto-Interp
Negative Logits
ACTIONS
-0.72
fingerprints
-0.68
Islamists
-0.67
striking
-0.65
itably
-0.64
LIMITED
-0.63
sights
-0.62
diapers
-0.62
pickups
-0.62
CONTROL
-0.61
POSITIVE LOGITS
Bow
0.88
rick
0.87
rentice
0.85
Blog
0.83
BB
0.83
Sports
0.81
Movie
0.81
veyard
0.80
yon
0.80
brew
0.79
Activations Density 0.106%