INDEX
    Explanations

    words related to people's names

    the end-of-text token and variations of the name "Alexey."

    New Auto-Interp
    Negative Logits
    mint
    -0.79
    istani
    -0.77
    ULAR
    -0.72
    atoon
    -0.71
    ãĥķãĤ©
    -0.63
    Present
    -0.62
    ivity
    -0.62
    atchewan
    -0.61
    ï
    -0.61
    EED
    -0.61
    POSITIVE LOGITS
    ewitness
    1.11
    kj
    0.86
    outube
    0.80
    er
    0.80
    oshi
    0.79
    ield
    0.78
    giene
    0.78
    alty
    0.75
    estinal
    0.74
    esy
    0.73
    Act Density 0.031%

    No Known Activations