INDEX
    Explanations

    occurrences of the letter "d" in various contexts

    New Auto-Interp
    Negative Logits
    ed
    -0.27
    ا
    -0.25
    ag
    -0.24
    ë¡ľ
    -0.24
    를
    -0.24
    et
    -0.23
    it
    -0.23
    B
    -0.22
    ont
    -0.22
    im
    -0.22
    POSITIVE LOGITS
    etection
    0.19
    iesel
    0.16
    ëĵĿ
    0.15
    iameter
    0.15
     Dane
    0.15
     hacks
    0.15
    oub
    0.15
    emon
    0.15
    داÙħ
    0.15
    ual
    0.15
    Act Density 0.050%

    No Known Activations