INDEX
    Explanations

    phrases containing the possessive pronoun "my" followed by numbers

    New Auto-Interp
    Negative Logits
     milf
    -1.26
     peppa
    -1.19
     effe
    -1.09
     madonna
    -1.09
     fuf
    -1.08
     excru
    -1.08
     inappro
    -1.07
     hentai
    -1.06
     erad
    -1.06
     jojo
    -1.04
    POSITIVE LOGITS
     my
    1.13
    <bos>
    1.11
    my
    1.02
     My
    0.92
     myself
    0.88
    My
    0.86
     MY
    0.81
     own
    0.81
     minha
    0.77
    MY
    0.77
    Act Density 0.159%

    No Known Activations