INDEX
    Explanations

    mathematical symbols and formatting elements

    New Auto-Interp
    Negative Logits
     Efq
    -1.59
     Monfieur
    -1.54
     myſelf
    -1.53
     Theſe
    -1.48
     itſelf
    -1.38
     iſt
    -1.36
     Jefus
    -1.35
     ―――――
    -1.33
     ſeveral
    -1.33
     Houſe
    -1.33
    POSITIVE LOGITS
     \
    1.08
    \
    0.74
     $\
    0.74
     {\
    0.73
     (
    0.73
    <eos>
    0.71
    </tr>
    0.70
     ...
    0.69
    <tr>
    0.68
    .\
    0.68
    Act Density 0.100%

    No Known Activations