INDEX
    Explanations

    words related to negative impacts or harms, particularly with a strong negative effect

    references to negative impacts or setbacks

    New Auto-Interp
    Negative Logits
    iosity
    -0.79
     guiIcon
    -0.72
    cius
    -0.67
    Definition
    -0.67
    rian
    -0.66
    orkshire
    -0.66
    HCR
    -0.65
    afort
    -0.65
    heid
    -0.65
    pora
    -0.64
    POSITIVE LOGITS
    blow
    0.90
    outs
    0.88
     blow
    0.88
     blows
    0.84
     retard
    0.82
    pipe
    0.81
    hole
    0.81
    out
    0.81
    bang
    0.81
     Blow
    0.79
    Act Density 0.012%

    No Known Activations