INDEX
Explanations
social media handles
mentions of social media platforms
New Auto-Interp
Negative Logits
calculus
-0.74
concentration
-0.73
overtime
-0.68
elim
-0.67
abil
-0.65
antit
-0.64
surg
-0.64
weld
-0.63
consequential
-0.63
ĪĴ
-0.62
POSITIVE LOGITS
0.96
Tumblr
0.94
Whats
0.94
Pastebin
0.92
Tumblr
0.86
0.86
MSN
0.80
Refresh
0.80
annels
0.78
Blog
0.77
Activations Density 0.502%