INDEX
    Explanations

    LaTeX commands and formatting

    New Auto-Interp
    Negative Logits
     Efq
    -1.32
     iſt
    -1.30
     myſelf
    -1.25
     ―――――
    -1.23
     Houſe
    -1.20
    mybatisplus
    -1.18
     itſelf
    -1.18
     Monfieur
    -1.16
     Theſe
    -1.15
     Shakspeare
    -1.10
    POSITIVE LOGITS
     \
    1.19
    \
    0.96
     $\
    0.85
    <eos>
    0.80
    <tr>
    0.77
    0.77
     {\
    0.76
    $\
    0.75
    {\
    0.72
    ↵↵
    0.71
    Act Density 0.092%

    No Known Activations