INDEX
    Explanations

    instances of the word "leap" where the activation value is high

    instances of the word "leap" in various contexts

    New Auto-Interp
    Negative Logits
    essee
    -1.08
    gel
    -0.80
    Interstitial
    -0.78
    ividually
    -0.70
    pmwiki
    -0.67
    ĻĤ
    -0.66
    liction
    -0.66
    gew
    -0.64
    matter
    -0.63
    leanor
    -0.62
    POSITIVE LOGITS
    frog
    1.14
     leaps
    1.02
     leap
    0.79
    olicy
    0.79
    rack
    0.76
     Leap
    0.74
    rers
    0.73
     Rivals
    0.69
    fruit
    0.69
     forward
    0.68
    Act Density 0.017%

    No Known Activations