INDEX
    Explanations

    instances of the word "decline" and its variations, indicating a focus on decrease or deterioration

    New Auto-Interp
    Negative Logits
    oso
    -0.08
    izable
    -0.07
    /place
    -0.07
    MOOTH
    -0.07
    feit
    -0.07
    rieve
    -0.07
    ctl
    -0.07
    cela
    -0.07
    nis
    -0.07
    illary
    -0.07
    POSITIVE LOGITS
    posables
    0.08
    afi
    0.07
    otron
    0.06
    acons
    0.06
    MBER
    0.06
    /exp
    0.06
    /de
    0.06
    à¥įद
    0.06
    .scalablytyped
    0.06
    inish
    0.06
    Act Density 0.008%

    No Known Activations