INDEX
    Explanations

    phrases related to controversies or negative events

    references to secrets or conspiracies involving influential figures or groups

    New Auto-Interp
    Negative Logits
     retained
    -0.72
     outset
    -0.69
    imentary
    -0.68
     centrally
    -0.68
    eper
    -0.68
     supplemented
    -0.64
    isites
    -0.63
    erial
    -0.63
     strengthened
    -0.62
     memorandum
    -0.62
    POSITIVE LOGITS
    âĢ
    1.29
     Elsa
    1.05
     anime
    1.05
     Naruto
    1.04
     Blizz
    1.04
    âľ
    1.03
    Pokemon
    1.02
     ponies
    1.00
     Twitch
    1.00
    Elsa
    0.99
    Act Density 0.879%

    No Known Activations