INDEX
    Explanations

    occurrences of the letter 'y' in various contexts

    New Auto-Interp
    Negative Logits
    i
    -0.32
    o
    -0.31
    a
    -0.29
    r
    -0.27
    t
    -0.24
    Axis
    -0.20
    e
    -0.20
    olated
    -0.20
    y
    -0.20
    auss
    -0.20
    POSITIVE LOGITS
    achts
    0.20
    tics
    0.17
    á»ĥm
    0.17
    ea
    0.17
    nothrow
    0.16
    outu
    0.16
    ernel
    0.16
    oked
    0.16
    ester
    0.16
    amaha
    0.15
    Act Density 0.061%

    No Known Activations