INDEX
    Explanations

    instances of the word "fail" and its variations, indicating a focus on failure and its implications

    New Auto-Interp
    Negative Logits
    vale
    -0.15
    ErrMsg
    -0.15
    eters
    -0.15
    etical
    -0.15
    ran
    -0.15
    eting
    -0.15
    onto
    -0.14
    eli
    -0.14
    ally
    -0.14
    ermal
    -0.14
    POSITIVE LOGITS
    afe
    0.24
     miser
    0.20
    ures
    0.20
     spectacular
    0.20
     fail
    0.20
     failures
    0.19
     failing
    0.19
     failure
    0.18
     fails
    0.18
     failed
    0.18
    Act Density 0.032%

    No Known Activations