INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     País
    -0.08
    多少
    -0.07
     wind
    -0.07
     confined
    -0.07
    portes
    -0.07
     Paul
    -0.07
    weni
    -0.07
     above
    -0.07
     assisting
    -0.07
     చెంద
    -0.07
    POSITIVE LOGITS
     vitória
    0.09
     Conduct
    0.08
     Creat
    0.08
    0.08
    Conduct
    0.08
    сия
    0.08
     steeds
    0.08
     Treat
    0.08
     victory
    0.07
    Hosts
    0.07
    Act Density 0.003%

    No Known Activations