INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Gamergate
    -0.94
    ospace
    -0.82
     advers
    -0.78
     Telecommunications
    -0.77
     WATCHED
    -0.76
    vernment
    -0.74
     seism
    -0.73
    pmwiki
    -0.71
     Personnel
    -0.71
    GBT
    -0.70
    POSITIVE LOGITS
     flavored
    1.45
     flavor
    1.33
     sauce
    1.32
    cake
    1.28
     tasting
    1.27
     pudding
    1.26
     flavour
    1.25
     delicious
    1.24
     tart
    1.23
     butter
    1.22
    Act Density 2.239%

    No Known Activations