INDEX
    Explanations

    instances of the letter "d" in various forms and contexts

    New Auto-Interp
    Negative Logits
    er
    -0.17
    utherford
    -0.16
    achuset
    -0.15
    ortex
    -0.15
    ISM
    -0.15
    added
    -0.15
    eenth
    -0.15
    bia
    -0.14
    isp
    -0.14
    alendar
    -0.14
    POSITIVE LOGITS
    rex
    0.15
    well
    0.15
    rego
    0.15
    robe
    0.14
    rix
    0.14
    åĭ¢
    0.14
    moon
    0.14
    aron
    0.14
     Investing
    0.14
    rias
    0.13
    Act Density 0.115%

    No Known Activations