INDEX
    Explanations

    articles and quantifiers indicating quantity or size

    New Auto-Interp
    Negative Logits
    ép
    -0.16
    Ĭ¶
    -0.15
    á»Ļp
    -0.14
    olini
    -0.14
    rupa
    -0.14
     McGr
    -0.14
    èº
    -0.14
    ersen
    -0.13
    lus
    -0.13
    ose
    -0.13
    POSITIVE LOGITS
    isque
    0.17
    heim
    0.16
    acher
    0.14
    ats
    0.13
    inkel
    0.13
    acker
    0.13
    eton
    0.13
    cilik
    0.13
    lsi
    0.13
    ambil
    0.13
    Act Density 0.179%

    No Known Activations