INDEX
    Explanations

    references to authors and their contributions in academic texts

    New Auto-Interp
    Negative Logits
    elan
    -0.16
    abra
    -0.16
    usp
    -0.15
     Amend
    -0.15
    аÑĢаÑĤ
    -0.15
    ody
    -0.15
     Premium
    -0.14
    ury
    -0.14
     Zus
    -0.14
    ESCO
    -0.14
    POSITIVE LOGITS
    clado
    0.15
    اث
    0.15
    idge
    0.15
    obao
    0.14
    $__
    0.14
    lichkeit
    0.14
    dül
    0.14
    serir
    0.14
    utdown
    0.14
    iske
    0.14
    Act Density 0.087%

    No Known Activations