INDEX
    Explanations

    flaws in, how vulnerable

    New Auto-Interp
    Negative Logits
     somewhat
    0.41
     довольно
    0.40
    いくつかの
    0.37
    ;,
    0.34
     miscellaneous
    0.34
     several
    0.33
     également
    0.32
     relatively
    0.31
     ',',
    0.31
    あるいは
    0.31
    POSITIVE LOGITS
     usato
    0.40
     morrer
    0.38
     usadas
    0.38
     usamos
    0.37
     সরাসরি
    0.37
     زیرمه
    0.37
     nostre
    0.37
     crisi
    0.37
     humanidad
    0.36
     شرطونو
    0.36
    Act Density 0.046%

    No Known Activations