INDEX
    Explanations

    indicators of demographic information

    New Auto-Interp
    Negative Logits
    lesi
    -0.08
     Nim
    -0.06
    мага
    -0.06
    inem
    -0.06
    anim
    -0.06
     å¹³æĸ¹
    -0.06
    orman
    -0.06
    inaire
    -0.06
    utow
    -0.06
    rani
    -0.06
    POSITIVE LOGITS
    umber
    0.06
     Te
    0.06
     Yak
    0.06
    ãĥĹ
    0.06
    onas
    0.06
    linger
    0.06
    ling
    0.06
     عش
    0.06
    DO
    0.06
    ascimento
    0.06
    Act Density 0.001%

    No Known Activations