INDEX
    Explanations

    terms associated with promises, pledges, and commitments

    New Auto-Interp
    Negative Logits
    EndTag
    -0.64
     استنادى
    -0.56
    minecraft
    -0.54
    MEK
    -0.53
     thụ
    -0.53
    ouwd
    -0.53
    nictwa
    -0.52
    irp
    -0.51
    ankar
    -0.51
    ModelAdmin
    -0.50
    POSITIVE LOGITS
     promise
    1.81
     Promise
    1.73
     promised
    1.72
     promises
    1.67
    Promise
    1.64
    promise
    1.61
     Promises
    1.55
     promessa
    1.51
     pledge
    1.51
     pledged
    1.50
    Act Density 0.231%

    No Known Activations