INDEX
    Explanations

    instances of the word "walk" and its variations within the context

    New Auto-Interp
    Negative Logits
    hw
    -0.16
    frei
    -0.15
    eve
    -0.15
     Downs
    -0.15
    ooke
    -0.14
    nici
    -0.14
    lef
    -0.14
    565
    -0.14
    aeper
    -0.14
    amus
    -0.13
    POSITIVE LOGITS
    arella
    0.19
    chedulers
    0.15
    ody
    0.14
    /email
    0.14
    erman
    0.14
    leÅŁik
    0.14
    jišť
    0.14
    adel
    0.14
     Jenn
    0.14
    ีà¸ŀ
    0.14
    Act Density 0.049%

    No Known Activations