INDEX
    Explanations

    instances of the letter 'f' in various contexts

    New Auto-Interp
    Negative Logits
    iska
    -0.18
    ade
    -0.17
    formed
    -0.17
    ero
    -0.15
     fav
    -0.15
    lined
    -0.15
    ruta
    -0.14
    era
    -0.14
     Restoration
    -0.14
    tero
    -0.14
    POSITIVE LOGITS
    ocaly
    0.16
    omor
    0.15
    rena
    0.15
    bidden
    0.14
    andre
    0.14
    Ế
    0.14
     miêu
    0.14
    ibaba
    0.14
    ToProps
    0.14
    teenth
    0.14
    Act Density 0.101%

    No Known Activations