INDEX
    Explanations

    references to American institutions or organizations

    New Auto-Interp
    Negative Logits
    uru
    -0.18
    alom
    -0.15
     Herrera
    -0.15
     मल
    -0.15
     RectTransform
    -0.15
    pector
    -0.15
    ride
    -0.14
    ĵĺ
    -0.14
    urus
    -0.14
    icha
    -0.14
    POSITIVE LOGITS
    onet
    0.16
    yny
    0.16
     streak
    0.16
    ikan
    0.16
    uguay
    0.15
    strup
    0.15
    ivan
    0.15
    stants
    0.15
     SES
    0.14
    åĽ
    0.14
    Act Density 0.000%

    No Known Activations