INDEX
    Explanations

    mathematical expressions or formulas

    New Auto-Interp
    Negative Logits
    es
    -0.68
     C
    -0.66
     on
    -0.66
     Mar
    -0.64
     in
    -0.64
     “
    -0.60
     Ab
    -0.59
    </strong>
    -0.58
    AnchorStyles
    -0.58
     –
    -0.57
    POSITIVE LOGITS
    \[
    1.45
     \[
    1.15
     myſelf
    1.11
     uſ
    1.08
     itſelf
    1.07
     ―――――
    1.07
    \]
    1.05
     Monfieur
    1.05
    awtextra
    1.00
     ſtate
    0.98
    Act Density 0.139%

    No Known Activations