INDEX
    Explanations

    mentions of the letter 'F' in various contexts

    New Auto-Interp
    Negative Logits
    ansa
    -0.20
     permanent
    -0.17
    airy
    -0.17
    ully
    -0.17
    athers
    -0.16
    emez
    -0.16
    resh
    -0.16
    iona
    -0.15
    057
    -0.15
     permanently
    -0.15
    POSITIVE LOGITS
    yon
    0.19
    edy
    0.18
    y
    0.17
    yh
    0.17
    etting
    0.17
    oted
    0.16
    urf
    0.16
    illion
    0.16
    eller
    0.15
    roud
    0.15
    Act Density 0.036%

    No Known Activations