INDEX
    Explanations

    occurrences of specific suffixes in words

    New Auto-Interp
    Negative Logits
    št
    -0.15
    olid
    -0.15
    ertype
    -0.15
    ajas
    -0.15
     decent
    -0.15
    je
    -0.14
    umont
    -0.14
    NEL
    -0.14
    logen
    -0.14
     log
    -0.14
    POSITIVE LOGITS
    Binder
    0.17
     Binder
    0.15
    assin
    0.15
    ane
    0.15
    onec
    0.14
    hani
    0.14
    imenti
    0.14
    agine
    0.14
    zano
    0.14
    vana
    0.13
    Act Density 0.000%

    No Known Activations