INDEX
    Explanations

    expressions of regret or acknowledgment of previous mistakes

    New Auto-Interp
    Negative Logits
     Reſ
    -1.28
     Anſ
    -1.28
     Houſe
    -1.20
     itſelf
    -1.19
     ―――――
    -1.18
     faſt
    -1.18
     houſe
    -1.17
     Theſe
    -1.17
     pleaſure
    -1.16
     Efq
    -1.11
    POSITIVE LOGITS
     noted
    1.15
     notes
    1.13
     notable
    1.09
     Note
    1.01
     note
    1.00
     Notable
    0.97
     Notes
    0.91
    Note
    0.89
    Notable
    0.85
    note
    0.79
    Act Density 0.143%

    No Known Activations