INDEX
    Explanations

    action verbs related to business, development, and functionality

    New Auto-Interp
    Negative Logits
    their
    -0.18
    sg
    -0.17
    ox
    -0.17
    oth
    -0.17
    eb
    -0.17
    ews
    -0.16
    och
    -0.16
    Ñı
    -0.16
    the
    -0.16
    they
    -0.15
    POSITIVE LOGITS
    cales
    0.17
     itself
    0.17
    uars
    0.17
    lick
    0.16
    rase
    0.15
    beg
    0.15
    heets
    0.15
    dete
    0.14
    PECIAL
    0.14
    LAY
    0.14
    Act Density 0.815%

    No Known Activations