INDEX
    Explanations

    expressions of regret or missed opportunities

    New Auto-Interp
    Negative Logits
    iyim
    -0.16
    offee
    -0.16
    ammen
    -0.15
    xin
    -0.14
    shal
    -0.14
     unprecedented
    -0.14
    adlo
    -0.14
    geber
    -0.14
    æĸ°çļĦ
    -0.14
     increasingly
    -0.13
    POSITIVE LOGITS
     sooner
    0.38
     earlier
    0.35
     instead
    0.32
    instead
    0.27
    ãĤĤãģ£ãģ¨
    0.26
     Instead
    0.24
    Earlier
    0.23
     Earlier
    0.23
    Instead
    0.23
    æĹ©
    0.22
    Act Density 0.234%

    No Known Activations