INDEX
    Explanations

    instances of the letter "f" in various contexts

    New Auto-Interp
    Negative Logits
    inar
    -0.17
    orest
    -0.17
    ammad
    -0.16
    alim
    -0.16
    ade
    -0.16
    оÑĢ
    -0.15
    raith
    -0.15
    itness
    -0.15
    avor
    -0.15
    erner
    -0.14
    POSITIVE LOGITS
     Dive
    0.19
    ag
    0.17
    defgroup
    0.16
    Ú¯ÛĮ
    0.15
     Hoy
    0.15
    ives
    0.14
    as
    0.14
    rol
    0.14
    IVE
    0.14
    ench
    0.14
    Act Density 0.127%

    No Known Activations