INDEX
    Explanations

    words related to negative qualities or traits

    Negative descriptors related to unpleasantness or harm

    New Auto-Interp
    Negative Logits
    inet
    -0.86
    ingham
    -0.85
    inoa
    -0.81
    HCR
    -0.79
    produced
    -0.76
    inez
    -0.75
    particip
    -0.74
    issued
    -0.73
    ination
    -0.73
    Particip
    -0.73
    POSITIVE LOGITS
     nasty
    1.25
     earthqu
    0.97
     adolesc
    0.96
     surprises
    0.94
     ugly
    0.87
     spoil
    0.85
     barb
    0.83
     poisonous
    0.77
     beasts
    0.76
     mud
    0.76
    Act Density 0.007%

    No Known Activations