INDEX
    Explanations

    phrases indicating the beginning or starting of something

    phrases indicating the initiation or progression of events or conditions

    New Auto-Interp
    Negative Logits
     oversaw
    -0.70
     Palest
    -0.65
    CI
    -0.65
     avoided
    -0.64
    done
    -0.64
     cares
    -0.63
    keeping
    -0.62
     supervised
    -0.61
     congratulated
    -0.60
    didn
    -0.60
    POSITIVE LOGITS
     crumble
    1.30
     emerge
    1.24
     unfold
    1.22
     fade
    1.20
     explode
    1.13
     dawn
    1.12
     trickle
    1.11
     sink
    1.11
     dissolve
    1.08
     fray
    1.08
    Act Density 0.162%

    No Known Activations