INDEX
Explanations
Twitter handles and hashtags
New Auto-Interp
Negative Logits
worms
-0.81
apers
-0.74
binding
-0.73
backer
-0.72
calcul
-0.71
compens
-0.71
conservancy
-0.70
iculty
-0.68
ordinate
-0.67
ignty
-0.67
POSITIVE LOGITS
————————
1.12
————
0.98
————————————————
0.96
Ibid
0.95
âľ
0.85
âĺ
0.84
Jonathan
0.83
Katherine
0.81
Shaun
0.80
Jonah
0.79
Activations Density 0.518%