INDEX
    Explanations

    positive descriptions or acknowledgments of accomplishments

    phrases indicating accomplishment or positive evaluation

    New Auto-Interp
    Negative Logits
    pora
    -0.71
    thur
    -0.64
    soDeliveryDate
    -0.62
    jay
    -0.62
    iframe
    -0.61
    TN
    -0.60
    devices
    -0.60
    letters
    -0.59
     passport
    -0.59
     Issue
    -0.57
    POSITIVE LOGITS
    nered
    0.94
    itud
    0.86
    rand
    0.73
    structed
    0.72
    dden
    0.69
    kered
    0.68
    alian
    0.67
    umenthal
    0.64
    ows
    0.63
    stood
    0.63
    Act Density 0.113%

    No Known Activations