INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     banho
    -0.08
     sizes
    -0.08
     gobiernos
    -0.07
     Tuhan
    -0.07
     Tus
    -0.07
    Tus
    -0.07
    进去
    -0.07
     Governments
    -0.07
    _UN
    -0.07
    Govern
    -0.07
    POSITIVE LOGITS
     достиг
    0.09
     faced
    0.08
    ellis
    0.08
     climbed
    0.08
    0.08
     પહોંચી
    0.08
     예상
    0.08
     forcing
    0.08
    (pk
    0.07
    0.07
    Act Density 0.001%

    No Known Activations