INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    akable
    -0.76
    aeda
    -0.69
    omaly
    -0.66
    orescent
    -0.64
     awe
    -0.63
     awoken
    -0.62
    semble
    -0.62
    undai
    -0.61
    abiding
    -0.60
     nonviolent
    -0.60
    POSITIVE LOGITS
     Media
    1.00
    Magazine
    0.83
    ileaks
    0.80
    Wire
    0.78
    Net
    0.78
     Networks
    0.78
     Gawker
    0.77
     Magazine
    0.77
     Entertainment
    0.76
    Ins
    0.75
    Act Density 0.005%

    No Known Activations