INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    cised
    -0.69
     Scandinavian
    -0.66
     refriger
    -0.63
    ricular
    -0.62
     circumcision
    -0.61
    vantage
    -0.61
    cision
    -0.60
    minded
    -0.60
    ortium
    -0.60
     Starr
    -0.60
    POSITIVE LOGITS
    storms
    0.95
     hashtag
    0.94
     (@
    0.92
    @@@@@@@@
    0.89
     hasht
    0.88
     "@
    0.87
     feeds
    0.85
    username
    0.84
    Tweet
    0.84
     Username
    0.83
    Act Density 0.367%

    No Known Activations