INDEX
    Explanations

    occurrences of the word "Joe"

    the presence of the substring "oe" in words

    New Auto-Interp
    Negative Logits
    ifiers
    -0.84
    ifiable
    -0.81
    ifier
    -0.79
     Ambro
    -0.70
    ifications
    -0.69
    abad
    -0.69
     glim
    -0.68
    ivities
    -0.67
    arians
    -0.67
    rons
    -0.66
    POSITIVE LOGITS
    ppel
    1.23
    zie
    1.10
    zzi
    1.05
    lect
    1.03
    cean
    1.02
    hler
    1.01
    hner
    0.97
    hl
    0.95
    utenant
    0.94
    ffer
    0.94
    Act Density 0.018%

    No Known Activations