INDEX
    Explanations

    the occurrence of the letter 'f' in the text

    New Auto-Interp
    Negative Logits
    oster
    -0.16
    unger
    -0.14
     Cunning
    -0.14
    arb
    -0.14
    alse
    -0.14
     («
    -0.14
     sidel
    -0.14
    /sidebar
    -0.13
    ILLE
    -0.13
     Stephens
    -0.13
    POSITIVE LOGITS
    982
    0.16
    STALL
    0.15
     Shaft
    0.15
    ouro
    0.15
     ladder
    0.14
    nis
    0.13
    CRET
    0.13
    rag
    0.13
    longleftrightarrow
    0.13
    etic
    0.13
    Act Density 0.018%

    No Known Activations