INDEX
    Explanations

    website urls at the end of sentences prompting the reader to share or read a story

    phrases indicating options or alternatives

    New Auto-Interp
    Negative Logits
    hower
    -0.83
    steen
    -0.73
    hol
    -0.67
    ouls
    -0.66
    ngth
    -0.64
    ļé
    -0.62
    matter
    -0.60
    puter
    -0.59
    atari
    -0.59
    natureconservancy
    -0.58
    POSITIVE LOGITS
     Share
    0.68
     Submit
    0.66
     Format
    0.65
     subscribe
    0.65
     Paste
    0.64
    ANGE
    0.63
     hear
    0.63
     Comment
    0.63
    yrics
    0.63
    leans
    0.62
    Act Density 0.038%

    No Known Activations