INDEX
    Explanations

    news headlines or article titles that prompt the reader to "Read more."

    instances of the word "Read," indicating sources or references for further information

    New Auto-Interp
    Negative Logits
    xon
    -0.82
    opard
    -0.70
    IDS
    -0.70
    IDA
    -0.67
    ounty
    -0.65
    TEXTURE
    -0.65
    OPER
    -0.64
    adish
    -0.64
    uay
    -0.64
    ascal
    -0.63
    POSITIVE LOGITS
     aloud
    0.98
    sburg
    0.93
    Write
    0.86
    iness
    0.83
    ahead
    0.83
     Read
    0.83
    just
    0.80
    gon
    0.80
    ying
    0.78
    scl
    0.78
    Act Density 0.014%

    No Known Activations