INDEX
Explanations
words that start with 's' and are followed by a verb ending in 'ing'
the presence of specific punctuation or symbols
New Auto-Interp
Negative Logits
Wikileaks
-0.69
hyde
-0.69
unrestricted
-0.63
starters
-0.63
Pharaoh
-0.63
incumbent
-0.62
undecided
-0.62
Cors
-0.60
Sacrament
-0.59
RIP
-0.59
POSITIVE LOGITS
outher
1.22
ierra
1.17
idd
1.17
iren
1.17
igma
1.16
uss
1.15
igm
1.15
agging
1.15
ixties
1.15
arah
1.14
Activations Density 0.025%