INDEX
    Explanations

    references to personal pronouns or reflexive pronouns

    New Auto-Interp
    Negative Logits
     purpoſe
    -1.32
     Majefty
    -1.26
     faſt
    -1.21
     houſe
    -1.18
     raiſ
    -1.17
     pleaſure
    -1.14
     ſtate
    -1.12
     Anſ
    -1.12
     Perſ
    -1.11
     Houſe
    -1.10
    POSITIVE LOGITS
     se
    1.33
     Se
    0.93
     si
    0.90
     es
    0.87
     le
    0.77
     be
    0.77
     je
    0.74
    ../../
    0.72
     s
    0.71
     ab
    0.71
    Act Density 0.028%

    No Known Activations