INDEX
    Explanations

    phrases indicating challenges or obstacles in various contexts

    New Auto-Interp
    Negative Logits
    kazy
    -0.17
    embr
    -0.15
    zug
    -0.15
    reserve
    -0.15
    ÑĪе
    -0.14
     reserve
    -0.14
    ôt
    -0.14
     Handy
    -0.14
    ιÏĥ
    -0.14
    zip
    -0.13
    POSITIVE LOGITS
     challenge
    0.32
    challenge
    0.27
     task
    0.26
    Challenge
    0.23
     Challenge
    0.23
     goal
    0.23
    task
    0.22
     attempt
    0.22
     aim
    0.21
     tries
    0.21
    Act Density 0.265%

    No Known Activations