INDEX
    Explanations

    words ending in 'es' with high activations

    occurrences of the suffix "es" at the end of words

    New Auto-Interp
    Negative Logits
    è»
    -0.74
    iage
    -0.73
    ItemTracker
    -0.70
    Reviewer
    -0.70
    amental
    -0.69
    GAN
    -0.66
    allery
    -0.66
    =-=-=-=-
    -0.65
    staff
    -0.65
    ONSORED
    -0.64
    POSITIVE LOGITS
    terday
    1.27
    ktop
    1.22
    peed
    1.09
    andro
    1.08
    earch
    1.08
    bians
    1.06
    pec
    0.94
    earchers
    0.94
    ury
    0.90
    leep
    0.89
    Act Density 0.042%

    No Known Activations