INDEX
    Explanations

    occurrences of the word "grim" or its variations in the context of negative situations

    New Auto-Interp
    Negative Logits
    aurus
    -0.08
    _CLIP
    -0.07
    elian
    -0.07
    oun
    -0.07
    asurer
    -0.07
    ียà¸ļ
    -0.07
    esis
    -0.06
    ylko
    -0.06
    roll
    -0.06
    use
    -0.06
    POSITIVE LOGITS
    linger
    0.08
    ness
    0.08
    elda
    0.08
    dest
    0.07
    aces
    0.07
    eton
    0.07
    acing
    0.07
    ities
    0.07
    lich
    0.06
    shaw
    0.06
    Act Density 0.005%

    No Known Activations