INDEX
    Explanations

    words related to intense emotions, particularly anger

    instances of the word "rage" along with related emotional expressions

    New Auto-Interp
    Negative Logits
    icut
    -0.85
    ramer
    -0.83
    herty
    -0.79
    rica
    -0.71
     coerc
    -0.71
    lder
    -0.68
    nai
    -0.68
     pse
    -0.67
    arent
    -0.67
     Liberties
    -0.66
    POSITIVE LOGITS
    quit
    1.06
     rage
    1.00
     fury
    0.89
     raging
    0.81
     furnace
    0.80
    ï¸
    0.78
    bol
    0.77
    TEXTURE
    0.76
    ously
    0.72
     vengeance
    0.71
    Act Density 0.019%

    No Known Activations