INDEX
    Explanations

    references to the word "my" or variations of it within the text

    New Auto-Interp
    Negative Logits
     itſelf
    -1.07
    bbene
    -0.97
     raiſ
    -0.94
    InjectAttribute
    -0.90
    abstractmethod
    -0.89
     Monfieur
    -0.89
     vectorielles
    -0.87
     vectorielle
    -0.86
     Houſe
    -0.84
     Efq
    -0.84
    POSITIVE LOGITS
     My
    1.34
     own
    1.26
     MY
    1.18
    My
    1.13
    my
    1.10
     my
    1.02
     HIS
    1.02
    MY
    1.00
    getMy
    0.98
     Her
    0.93
    Act Density 0.079%

    No Known Activations