INDEX
    Explanations

    instances of the letter 'y' in various contexts

    New Auto-Interp
    Negative Logits
    t
    -0.28
    ãĥ³
    -0.28
    yı
    -0.28
    on
    -0.26
    à¸ģ
    -0.26
    o
    -0.25
    ar
    -0.25
    i
    -0.25
    k
    -0.24
    h
    -0.23
    POSITIVE LOGITS
    اÙģØªÙĩ
    0.16
    ãĤĭãģ¨
    0.16
    blo
    0.16
    inou
    0.16
    asl
    0.15
    edx
    0.15
    dess
    0.15
    elib
    0.15
    timeofday
    0.15
    ponge
    0.15
    Act Density 0.061%

    No Known Activations