INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    RESULT
    -0.08
    Lee
    -0.08
     prevailing
    -0.08
    .random
    -0.08
    Ig
    -0.08
    Equip
    -0.07
     ಅವ
    -0.07
    DF
    -0.07
    ius
    -0.07
    Own
    -0.07
    POSITIVE LOGITS
     Cor
    0.08
    Translations
    0.08
     ADM
    0.08
     Supp
    0.08
     inj
    0.08
     Arr
    0.07
     itin
    0.07
    няй
    0.07
     الع
    0.07
    arr
    0.07
    Act Density 0.001%

    No Known Activations