INDEX
    Explanations

    variations of the word "some."

    New Auto-Interp
    Negative Logits
    oppers
    -0.17
    eous
    -0.16
    ivot
    -0.15
    overy
    -0.15
    eos
    -0.15
    ymm
    -0.15
    yonel
    -0.14
    yms
    -0.14
    ÑģÑĤа
    -0.14
    sik
    -0.14
    POSITIVE LOGITS
    ewhere
    0.32
    brero
    0.30
    ewhat
    0.29
    erville
    0.27
    erset
    0.26
    etime
    0.26
    mers
    0.25
    thing
    0.23
    ETIME
    0.22
    ber
    0.21
    Act Density 0.008%

    No Known Activations