INDEX
    Explanations

    phrases indicating the beginning or initiation of a process or sequence

    phrases starting with "first."

    New Auto-Interp
    Negative Logits
    jong
    -0.82
    bos
    -0.74
    md
    -0.72
    tics
    -0.71
    ingen
    -0.70
    sav
    -0.69
    ITH
    -0.68
    dain
    -0.66
    crim
    -0.66
    mbuds
    -0.66
    POSITIVE LOGITS
     thing
    1.06
     baseman
    1.00
     responders
    0.99
     lady
    0.93
     installment
    0.91
     impression
    0.89
     iteration
    0.86
     attempt
    0.86
     incarnation
    0.86
     impressions
    0.85
    Act Density 0.075%

    No Known Activations