INDEX
    Explanations

    headers and section titles in a document

    HTML heading tags (h2, h3, h4)

    New Auto-Interp
    Negative Logits
     ans
    -0.71
     co
    -0.70
     de
    -0.70
     to
    -0.69
     di
    -0.68
     ir
    -0.68
     b
    -0.68
     col
    -0.67
     in
    -0.66
     or
    -0.66
    POSITIVE LOGITS
     itſelf
    1.32
     juſt
    1.20
     greateſt
    1.18
     pleaſure
    1.17
     ſever
    1.16
     myſelf
    1.15
     ſmall
    1.13
     Diſ
    1.12
     leſs
    1.11
     deſt
    1.11
    Act Density 0.065%

    No Known Activations