INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     коль
    -0.07
     Ent
    -0.06
     zejména
    -0.06
     централь
    -0.06
    111
    -0.06
    kont
    -0.06
    ynomials
    -0.06
    ราะ
    -0.06
     vis
    -0.06
    -0.06
    POSITIVE LOGITS
     são
    0.34
     São
    0.14
     serão
    0.09
     foram
    0.07
     Sao
    0.07
    ÃO
    0.07
    sans
    0.06
    PACK
    0.06
    0.06
    0.06
    Act Density 0.004%

    No Known Activations