INDEX
Explanations
Twitter handles prefixed with '@'
names of people and their social media handles
New Auto-Interp
Negative Logits
wards
-0.67
striking
-0.65
sights
-0.65
notebooks
-0.64
bruising
-0.64
tense
-0.63
fingerprints
-0.62
certificates
-0.62
waivers
-0.62
"â̦
-0.62
POSITIVE LOGITS
_
1.18
Writ
1.08
Jew
1.03
GoldMagikarp
1.00
CBC
0.97
Blog
0.95
Ide
0.92
News
0.91
Reports
0.91
Report
0.87
Activations Density 0.145%