INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.09
    1:0.09
    2:0.07
    3:0.17
    4:0.04
    5:0.25
    6:0.03
    7:0.05
    8:0.05
    9:0.03
    10:0.05
    11:0.04
    Negative Logits
     viewership
    -2.33
     microphones
    -2.32
     Gawker
    -2.22
     Yiannopoulos
    -2.19
     Voting
    -2.18
    itbart
    -2.17
     Nielsen
    -2.14
     GamerGate
    -2.11
     MSNBC
    -2.09
     Feinstein
    -2.07
    POSITIVE LOGITS
    iso
    2.71
    FIX
    2.04
    rip
    2.03
    Dutch
    1.94
    BUG
    1.92
    ISO
    1.91
     Tos
    1.89
    ',"
    1.89
    ++;
    1.89
    Patch
    1.88
    Act Density 0.035%

    No Known Activations