INDEX
    Explanations

    phrases that emphasize conditions or requirements for achieving certain outcomes

    New Auto-Interp
    Negative Logits
    essor
    -0.15
     Rocket
    -0.14
    irim
    -0.13
    flen
    -0.13
    elpers
    -0.13
     hardcore
    -0.13
     Strict
    -0.13
    .lp
    -0.13
    ower
    -0.13
    aster
    -0.13
    POSITIVE LOGITS
     accord
    0.18
    edy
    0.18
     red
    0.16
     rid
    0.16
     forest
    0.16
     fashion
    0.15
     wrest
    0.15
     dign
    0.15
     cob
    0.15
    rive
    0.15
    Act Density 0.539%

    No Known Activations