INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     аналог
    -0.07
     jus
    -0.07
     Patty
    -0.07
     fractional
    -0.07
     kapas
    -0.07
    odule
    -0.06
     potato
    -0.06
     Pais
    -0.06
     Cory
    -0.06
    _Part
    -0.06
    POSITIVE LOGITS
     Men
    0.14
     men
    0.13
    men
    0.12
    Men
    0.12
     women
    0.10
     Women
    0.10
     MEN
    0.10
     menn
    0.10
    -men
    0.09
    EN
    0.09
    Act Density 0.033%

    No Known Activations