INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     tran
    -0.07
     rápido
    -0.07
    .len
    -0.06
    ningen
    -0.06
    ández
    -0.06
    _Content
    -0.06
    ίνα
    -0.06
    Credential
    -0.06
     تواند
    -0.06
     schn
    -0.06
    POSITIVE LOGITS
     Abu
    0.12
    ame
    0.07
    BSITE
    0.07
     confronting
    0.07
     notorious
    0.07
    dream
    0.07
    ammer
    0.06
    .onreadystatechange
    0.06
     أبو
    0.06
     доктор
    0.06
    Act Density 0.002%

    No Known Activations