INDEX
    Explanations

    imperative verbs followed by direct objects

    New Auto-Interp
    Negative Logits
     Languages
    -0.65
    Zen
    -0.65
    availability
    -0.63
    Interstitial
    -0.63
    inger
    -0.62
    bage
    -0.62
    liest
    -0.60
    marine
    -0.59
    DAQ
    -0.58
    natureconservancy
    -0.58
    POSITIVE LOGITS
    tered
    1.14
    tering
    0.95
    icia
    0.95
     us
    0.92
     me
    0.80
     loose
    0.78
    itia
    0.78
     him
    0.76
    ting
    0.75
     slip
    0.73
    Act Density 2.857%

    No Known Activations