INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
     Hamb
    -0.08
     ebony
    -0.08
     oxy
    -0.08
     Savannah
    -0.07
    Teste
    -0.07
     beneficiation
    -0.07
     Tus
    -0.07
     오류
    -0.07
     pari
    -0.07
    POSITIVE LOGITS
     asupra
    0.09
    няй
    0.08
    ילה
    0.08
     poderes
    0.08
    jte
    0.08
    lado
    0.08
    667
    0.08
    /sw
    0.07
     filosof
    0.07
    houd
    0.07
    Act Density 0.002%

    No Known Activations