INDEX
Explanations
items of communication-related significance, such as email sign-ups and newsletters
periods at the end of sentences
New Auto-Interp
Negative Logits
acquaintance
-0.69
hood
-0.69
cod
-0.68
fman
-0.68
oun
-0.68
smoked
-0.66
hom
-0.66
altar
-0.66
reversible
-0.65
enumer
-0.65
POSITIVE LOGITS
Subscribe
1.02
Please
0.91
Want
0.90
push
0.89
letters
0.89
Each
0.88
Readers
0.87
avanaugh
0.86
Its
0.85
Visit
0.84
Activations Density 0.196%