INDEX
    Explanations

    phrases related to causing negative effects or harm

    phrases related to causality and the effects of actions or events

    New Auto-Interp
    Negative Logits
    stra
    -0.76
    sonian
    -0.70
    zb
    -0.69
     Rated
    -0.67
    ature
    -0.67
     Marketable
    -0.67
    nesday
    -0.67
     Introdu
    -0.66
    itsch
    -0.66
    ramid
    -0.66
    POSITIVE LOGITS
     havoc
    1.47
     headaches
    1.21
     confusion
    1.16
     mayhem
    1.15
     trouble
    1.14
     problems
    1.06
     panic
    1.04
     undue
    1.01
     irreversible
    0.98
     irritation
    0.98
    Act Density 0.054%

    No Known Activations