INDEX
    Explanations

    instances of the letter 'f' in various contexts

    New Auto-Interp
    Negative Logits
    ypy
    -0.19
    wert
    -0.17
    VERS
    -0.16
    ire
    -0.15
    vala
    -0.15
    itz
    -0.15
    oring
    -0.15
    throp
    -0.15
    PEG
    -0.15
    ires
    -0.14
    POSITIVE LOGITS
    ic
    0.19
    ase
    0.17
    oment
    0.16
    iche
    0.16
    achu
    0.16
    het
    0.15
    ilde
    0.15
    ision
    0.15
    etic
    0.15
    idel
    0.15
    Act Density 0.013%

    No Known Activations