INDEX
    Explanations

    instances of the word "my."

    New Auto-Interp
    Negative Logits
     itſelf
    -0.78
    ſelves
    -0.67
     faſt
    -0.60
    ſtand
    -0.59
     ſtate
    -0.59
     ſever
    -0.59
    leſs
    -0.57
     ſtand
    -0.57
    ſtance
    -0.56
     houſe
    -0.55
    POSITIVE LOGITS
     my
    1.34
    my
    1.11
    My
    1.09
     My
    1.06
     MY
    0.99
     minha
    0.85
    MY
    0.84
     mijn
    0.84
    getMy
    0.83
    Mijn
    0.79
    Act Density 0.064%

    No Known Activations