INDEX
    Explanations

    terms related to translations and guidelines in various contexts

    New Auto-Interp
    Negative Logits
    thers
    -0.17
    izu
    -0.17
    edBy
    -0.16
    иÑģÑĤÑĢа
    -0.15
    alez
    -0.15
    edis
    -0.14
    ãİ
    -0.14
    ovenant
    -0.14
    kö
    -0.14
    ales
    -0.14
    POSITIVE LOGITS
    rell
    0.16
    RW
    0.15
    ấn
    0.14
     ******************************************************************************↵
    0.14
     begr
    0.14
    613
    0.14
    zt
    0.14
    ancy
    0.14
     Dear
    0.14
    Ell
    0.13
    Act Density 0.005%

    No Known Activations