INDEX
    Explanations

    mentions of specific individuals and their social media activities

    New Auto-Interp
    Negative Logits
    ettes
    -0.17
    illos
    -0.16
    tainment
    -0.16
    íķĺìĭł
    -0.15
    indow
    -0.15
    ÑĢип
    -0.14
    pery
    -0.14
    umph
    -0.14
    ÑĥÑĢа
    -0.14
     ÑĨи
    -0.14
    POSITIVE LOGITS
     throw
    0.21
     IG
    0.20
     Throw
    0.19
     Inst
    0.19
     steam
    0.17
    aylor
    0.17
     throwing
    0.17
     Emm
    0.17
     private
    0.17
     wax
    0.16
    Act Density 0.153%

    No Known Activations