INDEX
    Explanations

    punctuation marks, especially commas

    New Auto-Interp
    Negative Logits
     }}$}
    -1.02
    ſelf
    -0.95
     Efq
    -0.95
     myſelf
    -0.94
     Majefty
    -0.90
    ^(@)
    -0.90
     cherchés
    -0.89
    ſelves
    -0.86
     itſelf
    -0.85
     ―――――
    -0.85
    POSITIVE LOGITS
     I
    0.89
     it
    0.80
     you
    0.76
    <eos>
    0.72
     we
    0.72
     If
    0.70
     Do
    0.69
     He
    0.68
    0.67
     The
    0.66
    Act Density 0.155%

    No Known Activations