INDEX
    Explanations

    directives or suggestions prompting action

    phrases instructing the reader to take specific actions or follow advice

    New Auto-Interp
    Negative Logits
    deen
    -0.65
    vell
    -0.64
    Ö¼
    -0.63
    ELD
    -0.63
     territ
    -0.62
     satisf
    -0.55
    Hum
    -0.54
    uko
    -0.52
    onew
    -0.52
    zb
    -0.51
    POSITIVE LOGITS
     to
    0.87
     yourself
    0.83
     bookmark
    0.79
     checking
    0.78
     yourselves
    0.78
     Nanto
    0.76
     scrolling
    0.73
     checkout
    0.70
    assetsadobe
    0.70
     downloading
    0.68
    Act Density 0.099%

    No Known Activations