INDEX
    Explanations

    the word "except" and its variations, indicating exclusions or exceptions

    New Auto-Interp
    Negative Logits
    coni
    -0.21
    kir
    -0.15
    486
    -0.15
    kola
    -0.14
    ause
    -0.14
     (
    -0.14
    izza
    -0.14
     Lair
    -0.14
    FIX
    -0.14
    ftar
    -0.14
    POSITIVE LOGITS
    ing
    0.37
    ting
    0.19
    ING
    0.19
    s
    0.17
    ed
    0.17
    ingly
    0.16
    reme
    0.15
    antly
    0.15
    edException
    0.15
    eur
    0.15
    Act Density 0.011%

    No Known Activations