INDEX
    Explanations

    mentions of the word "emails"

    New Auto-Interp
    Negative Logits
    iasis
    -0.75
    axis
    -0.74
    stood
    -0.70
     Glasgow
    -0.68
    GBT
    -0.68
    OV
    -0.67
    mbuds
    -0.67
    vic
    -0.67
    hood
    -0.67
     Edinburgh
    -0.66
    POSITIVE LOGITS
     Emails
    0.94
     inbox
    0.91
     correspondence
    0.88
     dumps
    0.88
    ileaks
    0.88
     exchanged
    0.85
     emails
    0.85
     messages
    0.82
     newsletters
    0.82
    archive
    0.81
    Act Density 0.022%

    No Known Activations