INDEX
    Explanations

    words related to challenges or difficulty

    instances of the word "tough."

    New Auto-Interp
    Negative Logits
    ATURE
    -0.78
    atern
    -0.77
    Royal
    -0.73
    Fac
    -0.73
    oples
    -0.72
    uate
    -0.71
    Footnote
    -0.70
    uality
    -0.70
    iliary
    -0.69
     Veter
    -0.69
    POSITIVE LOGITS
     cookie
    0.85
     adolesc
    0.83
     enough
    0.81
     hitters
    0.81
     tough
    0.80
     obstacles
    0.75
     hitter
    0.74
     luck
    0.72
    shield
    0.72
    boss
    0.72
    Act Density 0.018%

    No Known Activations