INDEX
    Explanations

    definite articles in various contexts

    New Auto-Interp
    Negative Logits
    et
    -0.18
     pac
    -0.14
    etting
    -0.14
    eka
    -0.14
    ulu
    -0.14
    psilon
    -0.14
    AMP
    -0.14
    FA
    -0.14
    pac
    -0.14
    awa
    -0.13
    POSITIVE LOGITS
    sembl
    0.15
    strap
    0.15
    urge
    0.14
    594
    0.14
    çak
    0.14
    á»įng
    0.13
    853
    0.13
     بداÙĨ
    0.13
    eck
    0.13
    antha
    0.13
    Act Density 0.025%

    No Known Activations