INDEX
    Explanations

    instances of the letter 'h' and variations in its usage within text

    New Auto-Interp
    Negative Logits
    831
    -0.17
    erez
    -0.17
    uchos
    -0.16
    idence
    -0.15
    биÑĤ
    -0.15
    ziej
    -0.15
    ило
    -0.15
    hani
    -0.14
    lei
    -0.14
    erne
    -0.14
    POSITIVE LOGITS
    undra
    0.22
    uv
    0.20
    ela
    0.19
    ustr
    0.19
    als
    0.17
    vide
    0.17
    ov
    0.17
    onom
    0.16
    oved
    0.16
    448
    0.16
    Act Density 0.006%

    No Known Activations