INDEX
    Explanations

    instances of the letter 'Y' in various contexts

    New Auto-Interp
    Negative Logits
    iero
    -0.15
    icher
    -0.15
    alore
    -0.15
    apore
    -0.14
    killer
    -0.14
    lete
    -0.14
    åł
    -0.14
    ást
    -0.14
    icode
    -0.14
    ysts
    -0.14
    POSITIVE LOGITS
    ea
    0.24
    psilon
    0.23
    atra
    0.20
    achts
    0.20
    acht
    0.20
    VES
    0.19
    ves
    0.19
    ields
    0.19
    von
    0.18
    ogi
    0.18
    Act Density 0.040%

    No Known Activations