INDEX
    Explanations

    phrases or words related to delayed or postponed events

    variable-length substrings or prefixes that could represent various forms of ongoing action or condition

    New Auto-Interp
    Negative Logits
    bots
    -0.61
     ethics
    -0.58
     squats
    -0.58
    NESS
    -0.58
     manipulation
    -0.57
    ĺħ
    -0.56
     shaming
    -0.55
    SIZE
    -0.55
    _-
    -0.54
    DM
    -0.54
    POSITIVE LOGITS
    ested
    0.91
    ited
    0.89
    erest
    0.86
    ivated
    0.84
    istant
    0.83
    aunted
    0.82
    vered
    0.79
    oved
    0.79
    oried
    0.79
    ained
    0.78
    Act Density 0.131%

    No Known Activations