INDEX
Explanations
Twitter handles to follow or contact
references to social media handles and following instructions
New Auto-Interp
Negative Logits
forgiven
-0.78
raints
-0.65
forced
-0.64
etheless
-0.63
raped
-0.62
venge
-0.57
bably
-0.56
hindsight
-0.55
ģĸ
-0.54
nai
-0.54
POSITIVE LOGITS
@
1.04
0.99
(@
0.95
0.85
0.85
tweets
0.84
0.83
Website
0.79
edin
0.79
Tweet
0.79
Activations Density 0.035%