INDEX
    Explanations

    the presence of HTML-like or structured markup in a document

    New Auto-Interp
    Negative Logits
    )}+\
    -0.56
    }}+\
    -0.52
    }}}{\
    -0.51
    }}=\
    -0.51
    }))
    
    -0.51
    }}}{
    -0.50
    ])/
    -0.50
     Bop
    -0.48
    )});
    -0.48
    }}}}
    -0.48
    POSITIVE LOGITS
     nevertheless
    1.41
     but
    1.34
     nonetheless
    1.27
     yet
    1.18
     Nonetheless
    1.16
     But
    1.16
     Nevertheless
    1.15
    but
    1.12
    Nonetheless
    1.12
     however
    1.11
    Act Density 0.166%

    No Known Activations