INDEX
    Explanations

    words related to a decrease, decline, or struggle in various contexts

    references to failure or decline in various contexts

    New Auto-Interp
    Negative Logits
    ourced
    -0.77
    spring
    -0.75
    arily
    -0.72
    ifications
    -0.70
    ifact
    -0.70
    ends
    -0.69
    arte
    -0.69
    arity
    -0.68
    lander
    -0.68
    alde
    -0.67
    POSITIVE LOGITS
    cester
    1.20
    pless
    0.78
    ãĥĥãĥī
    0.78
    ãĥ³ãĤ¸
    0.77
    ggle
    0.69
    riers
    0.68
    à¸
    0.68
    agles
    0.67
    cemic
    0.66
    cest
    0.66
    Act Density 0.019%

    No Known Activations