INDEX
    Explanations

    the letter 'l' in various contexts

    New Auto-Interp
    Negative Logits
    owed
    -0.15
    inea
    -0.15
    imson
    -0.14
    ussen
    -0.14
    afone
    -0.14
    caster
    -0.14
    azzo
    -0.14
    svp
    -0.14
    isher
    -0.14
    innen
    -0.14
    POSITIVE LOGITS
    port
    0.15
    orts
    0.15
     rib
    0.14
    gin
    0.14
    aper
    0.14
    iju
    0.14
    llib
    0.14
     bel
    0.13
    orted
    0.13
     ifndef
    0.13
    Act Density 0.003%

    No Known Activations