INDEX
    Explanations

    punctuation marks and structure indicators in the text

    New Auto-Interp
    Negative Logits
    orr
    -0.16
    ucz
    -0.15
    arme
    -0.15
     Posted
    -0.15
    aise
    -0.14
    \$
    -0.14
     \$
    -0.14
    ifo
    -0.14
    orro
    -0.14
    umo
    -0.13
    POSITIVE LOGITS
    \
    0.29
     \
    0.19
    \v
    0.18
    \b
    0.17
    \db
    0.16
    \e
    0.15
    \n
    0.15
    \Url
    0.14
    levator
    0.14
    /REC
    0.14
    Act Density 0.081%

    No Known Activations