INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _flash
    -0.09
     Aboriginal
    -0.08
    _el
    -0.08
     lezen
    -0.08
    ترین
    -0.08
    _vect
    -0.08
     faptul
    -0.08
     рассказы
    -0.08
    λύ
    -0.08
     торговли
    -0.07
    POSITIVE LOGITS
     Proposition
    0.08
     proposition
    0.08
     questões
    0.08
     Received
    0.07
    ayment
    0.07
     paquet
    0.07
     Payment
    0.07
     precaution
    0.07
     Sig
    0.07
     PAYMENT
    0.07
    Act Density 0.001%

    No Known Activations