INDEX
    Explanations

    words related to insults or derogatory remarks

    phrases related to insults or derogatory language

    New Auto-Interp
    Negative Logits
    ail
    -0.75
    negie
    -0.73
    ucket
    -0.69
    angler
    -0.67
    ills
    -0.67
    iggle
    -0.66
    rief
    -0.66
    arten
    -0.65
    issue
    -0.65
    ECK
    -0.64
    POSITIVE LOGITS
     insult
    0.97
    ingly
    0.95
     insulted
    0.93
     disrespect
    0.91
     insulting
    0.86
     insults
    0.83
     ridicule
    0.77
    ãĤ¹ãĥĪ
    0.74
    xual
    0.74
     Gaw
    0.73
    Act Density 0.041%

    No Known Activations