INDEX
    Explanations

    words related to negative impact or disruption

    words and phrases associated with causing harm or negative consequences

    New Auto-Interp
    Negative Logits
    "},
    -0.65
     Seeking
    -0.60
     spons
    -0.59
    Activity
    -0.58
    bleacher
    -0.57
    nings
    -0.57
     regards
    -0.56
    ordering
    -0.56
    DragonMagazine
    -0.55
     rooting
    -0.55
    POSITIVE LOGITS
    ively
    1.15
     yourselves
    1.03
     herself
    1.00
     himself
    0.99
     yourself
    0.88
    iously
    0.88
     themselves
    0.88
    ingly
    0.85
    ously
    0.83
     ourselves
    0.83
    Act Density 0.486%

    No Known Activations