INDEX
Explanations
websites referenced in tweets
instances of social media interactions represented as URLs
New Auto-Interp
Negative Logits
©¶æ
-0.79
senal
-0.73
tracts
-0.67
captive
-0.66
restoration
-0.66
-+-+
-0.66
civilisation
-0.65
exha
-0.64
morale
-0.64
behavi
-0.64
POSITIVE LOGITS
com
1.35
1.04
org
0.99
twitch
0.97
gov
0.95
0.92
dk
0.91
blogspot
0.88
wordpress
0.88
nl
0.87
Activations Density 0.016%