INDEX
    Explanations

    articles and determiners indicating specific entities or subjects

    New Auto-Interp
    Negative Logits
    iquer
    -0.16
     unb
    -0.15
    ÅĤaw
    -0.14
    abbo
    -0.14
     onCancelled
    -0.14
    ucher
    -0.13
    orners
    -0.13
    utzer
    -0.13
     célib
    -0.13
    pheric
    -0.13
    POSITIVE LOGITS
    553
    0.14
    лем
    0.14
    rek
    0.14
    uld
    0.14
    269
    0.13
    ůr
    0.13
    kla
    0.13
    alia
    0.13
    irit
    0.13
    kat
    0.13
    Act Density 0.128%

    No Known Activations