INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    cidade
    -0.06
     těl
    -0.06
     dois
    -0.06
    iotics
    -0.06
    .Children
    -0.06
    _receive
    -0.06
    mploy
    -0.06
     tamanho
    -0.06
    .renderer
    -0.06
     Reyn
    -0.06
    POSITIVE LOGITS
    cpt
    0.07
     yelled
    0.07
    ELSE
    0.07
     blitz
    0.07
    0.07
    ioned
    0.07
    Split
    0.06
     fakt
    0.06
     Katie
    0.06
    ounced
    0.06
    Act Density 0.001%

    No Known Activations