INDEX
    Explanations

    occurrences of the letter "s" in various forms

    New Auto-Interp
    Negative Logits
    alt
    -0.17
    imd
    -0.17
    oup
    -0.16
    igma
    -0.16
    imen
    -0.15
    aci
    -0.15
    outu
    -0.15
    ampton
    -0.15
    и
    -0.15
    ci
    -0.15
    POSITIVE LOGITS
    iph
    0.20
    ust
    0.19
    ear
    0.18
    icken
    0.17
     tart
    0.17
    uss
    0.17
     af
    0.16
    urch
    0.16
    ough
    0.15
    ells
    0.15
    Act Density 0.039%

    No Known Activations