INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     imágenes
    -0.08
     сог
    -0.07
     göl
    -0.07
     dejting
    -0.07
     со
    -0.07
     رسول
    -0.06
    .indices
    -0.06
    upal
    -0.06
    CancelButton
    -0.06
    _categories
    -0.06
    POSITIVE LOGITS
    ção
    0.09
    inho
    0.09
    ÇÃO
    0.08
    ário
    0.08
    izacao
    0.08
    ão
    0.08
     Brazilian
    0.08
    iedade
    0.07
     Lima
    0.07
    ÃO
    0.07
    Act Density 0.879%

    No Known Activations