INDEX
    Explanations

    words ending with "er"

    New Auto-Interp
    Negative Logits
    пеки
    -0.07
    perience
    -0.07
     seas
    -0.06
    ственное
    -0.06
    _areas
    -0.06
    452
    -0.06
    альное
    -0.06
    Art
    -0.06
     degree
    -0.06
     remains
    -0.06
    POSITIVE LOGITS
    er
    0.18
    ER
    0.16
    ers
    0.14
    ler
    0.13
    ator
    0.12
    ner
    0.12
    cher
    0.12
    aker
    0.12
    or
    0.12
    inger
    0.12
    Act Density 0.627%

    No Known Activations