INDEX
    Explanations

    references to media content or messaging

    references to media content

    New Auto-Interp
    Negative Logits
     Salvador
    -0.80
    adian
    -0.76
     damn
    -0.75
     goddamn
    -0.71
    orney
    -0.70
     polyg
    -0.69
     pencil
    -0.65
    pired
    -0.63
    clus
    -0.61
     Alfred
    -0.61
    POSITIVE LOGITS
    Media
    1.26
     playback
    0.86
    media
    0.85
     VIDEOS
    0.85
    Wiki
    0.82
    Buzz
    0.82
    Sources
    0.80
    fax
    0.79
     Reporter
    0.77
    eval
    0.76
    Act Density 0.011%

    No Known Activations