INDEX
    Explanations

    email related text

    New Auto-Interp
    Negative Logits
    bats
    -0.87
    SPONSORED
    -0.74
     cabinets
    -0.71
    nels
    -0.68
    soDeliveryDate
    -0.66
    Stud
    -0.64
    abouts
    -0.64
    hend
    -0.64
    OPS
    -0.63
    kees
    -0.62
    POSITIVE LOGITS
     occurred
    0.71
     Cancel
    0.70
     allegation
    0.63
     inacc
    0.63
    ]}
    0.63
     Error
    0.62
     omission
    0.59
     error
    0.58
     Invalid
    0.58
     Failed
    0.57
    Act Density 5.272%

    No Known Activations