INDEX
    Explanations

    occurrences of the word "even."

    New Auto-Interp
    Negative Logits
    chwitz
    -0.18
    wort
    -0.16
    chin
    -0.15
    èm
    -0.15
    xies
    -0.15
    conte
    -0.14
    ocaly
    -0.14
    undra
    -0.14
    quate
    -0.14
    dit
    -0.14
    POSITIVE LOGITS
    wel
    0.24
    -handed
    0.19
    ness
    0.19
     slightest
    0.16
     though
    0.16
    flo
    0.16
    quiry
    0.16
    zo
    0.16
    398
    0.15
    Though
    0.15
    Act Density 0.066%

    No Known Activations