INDEX
    Explanations

    loan and descriptive words

    New Auto-Interp
    Negative Logits
     rồi
    0.43
     tint
    0.41
    0.41
    0.41
    ने
    0.41
     informée
    0.40
    يث
    0.40
     кнопку
    0.40
     polít
    0.40
    க்கொண்டு
    0.39
    POSITIVE LOGITS
    classification
    0.50
    ponsorship
    0.46
    utility
    0.44
    ToReference
    0.44
     अनुच्छेद
    0.43
    elivery
    0.42
    signals
    0.42
    LetterIndex
    0.42
     referral
    0.41
    loan
    0.41
    Act Density 0.002%

    No Known Activations