INDEX
    Explanations

    references to the word "next" and related notions of progression or sequence in text

    New Auto-Interp
    Negative Logits
    onn
    -0.18
    ampo
    -0.16
    ernity
    -0.16
    vides
    -0.15
    invert
    -0.14
    ยà¸ĩ
    -0.14
    ypse
    -0.14
    hints
    -0.14
    enas
    -0.14
    ITTE
    -0.14
    POSITIVE LOGITS
     article
    0.20
     articles
    0.17
     Lawson
    0.17
    ana
    0.16
     unc
    0.16
    uss
    0.15
    jer
    0.15
    idar
    0.14
    ebra
    0.14
     Articles
    0.14
    Act Density 0.001%

    No Known Activations