INDEX
    Explanations

    references to academic or research institutions

    New Auto-Interp
    Negative Logits
    OURS
    -0.17
    Uploaded
    -0.17
    utow
    -0.16
     indeb
    -0.15
    Ñĩного
    -0.15
    HEL
    -0.15
    edBy
    -0.15
     theres
    -0.14
    Äįit
    -0.14
    aylight
    -0.14
    POSITIVE LOGITS
     Against
    0.21
     fur
    0.19
     of
    0.17
     Of
    0.16
     des
    0.16
    /D
    0.16
     fuer
    0.15
    (s
    0.15
     für
    0.15
    /S
    0.15
    Act Density 0.148%

    No Known Activations