INDEX
    Explanations

    instances of the word "there" indicating presence or existence

    New Auto-Interp
    Negative Logits
    ecut
    -0.15
    ruit
    -0.14
    ativo
    -0.14
    neh
    -0.14
    azzi
    -0.13
    icensing
    -0.13
    âce
    -0.13
    ewear
    -0.13
    positor
    -0.13
    ukkit
    -0.13
    POSITIVE LOGITS
     follow
    0.20
     remain
    0.18
     need
    0.18
     reign
    0.18
     lo
    0.17
     lur
    0.17
     interven
    0.16
     lay
    0.16
     dw
    0.16
    окол
    0.16
    Act Density 0.092%

    No Known Activations