INDEX
    Explanations

    words related to strong negative sentiments or emotions, particularly hatred

    expressions of dislike or hatred toward various subjects

    New Auto-Interp
    Negative Logits
    DragonMagazine
    -1.08
    igmatic
    -0.93
    ItemImage
    -0.89
    OGR
    -0.83
    aunder
    -0.82
    enture
    -0.78
    arov
    -0.78
    eva
    -0.77
    akeru
    -0.76
    aqu
    -0.75
    POSITIVE LOGITS
    fully
    1.03
     hated
    0.96
     wasting
    0.88
     hate
    0.86
     Mondays
    0.85
    lessly
    0.83
    hate
    0.82
     hates
    0.77
     dearly
    0.76
     bullies
    0.72
    Act Density 0.061%

    No Known Activations