INDEX
    Explanations

    the word "that" and references to individuals

    New Auto-Interp
    Negative Logits
     itſelf
    -1.96
     myſelf
    -1.89
     Efq
    -1.77
     purpoſe
    -1.74
     ―――――
    -1.74
     Majefty
    -1.69
     Monfieur
    -1.68
     doubtnut
    -1.68
     himſelf
    -1.63
     themſelves
    -1.61
    POSITIVE LOGITS
    1.07
     That
    1.07
    That
    1.06
    <eos>
    0.97
     (
    0.95
     что
    0.93
     I
    0.91
     that
    0.85
    .
    0.85
    ,
    0.85
    Act Density 0.118%

    No Known Activations