INDEX
    Explanations

    the word "find" and its variations across different contexts

    New Auto-Interp
    Negative Logits
    odic
    -0.15
    ubern
    -0.15
    er
    -0.15
    von
    -0.14
    vrier
    -0.14
    umm
    -0.14
    sted
    -0.14
    veau
    -0.13
    ongo
    -0.13
    stitial
    -0.13
    POSITIVE LOGITS
    ائÙĦ
    0.15
    änn
    0.15
    touch
    0.15
    661
    0.15
     McN
    0.14
    lernen
    0.14
    ç¾
    0.14
    boot
    0.14
    ÏĦια
    0.14
    iless
    0.14
    Act Density 0.004%

    No Known Activations