INDEX
    Explanations

    tokens that mark gradual erosion or diminution of something (phrases describing wearing away or chipping away at a person, object, or state).

    New Auto-Interp
    Negative Logits
    ез
    0.86
    ammed
    0.78
    ни
    0.77
    als
    0.76
    cture
    0.75
    aminen
    0.75
    ceans
    0.73
    akoti
    0.73
     चांगली
    0.73
    ку
    0.72
    POSITIVE LOGITS
    G
    1.06
    T
    0.89
    W
    0.87
    P
    0.86
    H
    0.86
    U
    0.86
    ON
    0.84
    O
    0.84
    </h4>
    0.81
    l
    0.80
    Act Density 0.004%

    No Known Activations