INDEX
    Explanations

    various verb forms and participles related to actions or states

    New Auto-Interp
    Negative Logits
    sWith
    -0.17
    DAQ
    -0.17
    dere
    -0.17
    ADDE
    -0.17
    yme
    -0.15
    ibar
    -0.15
    ansi
    -0.15
    SSERT
    -0.15
    iek
    -0.14
    stoff
    -0.14
    POSITIVE LOGITS
    olan
    0.16
    erty
    0.15
    rap
    0.15
    zan
    0.14
    abh
    0.14
    oor
    0.14
     Mang
    0.13
    æŀļ
    0.13
    жа
    0.13
    ih
    0.13
    Act Density 0.243%

    No Known Activations