INDEX
    Explanations

    articles and determiners associated with nouns

    New Auto-Interp
    Negative Logits
    elah
    -0.15
    omu
    -0.15
    awan
    -0.15
    bow
    -0.14
    ensen
    -0.14
    ouri
    -0.14
    idi
    -0.14
    ecx
    -0.14
    yan
    -0.13
    à¥ĭध
    -0.13
    POSITIVE LOGITS
    ODY
    0.16
    ниÑĤ
    0.15
     جدا
    0.14
     Lone
    0.14
     founding
    0.13
    åħµ
    0.13
    구
    0.13
    åĿĢ
    0.13
    /MPL
    0.13
    cae
    0.13
    Act Density 0.014%

    No Known Activations