INDEX
    Explanations

    non-English text

    New Auto-Interp
    Negative Logits
     uburyo
    -0.08
     opiniões
    -0.08
     imọran
    -0.07
    Ls
    -0.07
    icients
    -0.07
     подходят
    -0.07
    Who's
    -0.07
    ున్నారు
    -0.07
    ثرة
    -0.07
    Does
    -0.07
    POSITIVE LOGITS
     ráð
    0.20
     grein
    0.16
     preuve
    0.13
     kazi
    0.13
     hivyo
    0.12
     yhteisty
    0.12
     amfani
    0.12
     crecer
    0.11
     alsof
    0.11
     שימוש
    0.11
    Act Density 0.014%

    No Known Activations