INDEX
    Explanations

    personal pronouns related to self-reflection

    references to the pronoun "you."

    New Auto-Interp
    Negative Logits
    ¿½
    -0.73
    assembly
    -0.66
     Thomson
    -0.65
    stadt
    -0.60
    20439
    -0.60
    EStream
    -0.58
    pty
    -0.57
     Commerce
    -0.57
     Innocent
    -0.56
    shows
    -0.56
    POSITIVE LOGITS
    're
    1.80
    've
    1.52
    'll
    1.32
    'd
    1.10
     yourselves
    1.07
     owe
    1.07
     know
    1.06
     are
    1.04
     yourself
    1.02
    ngth
    1.02
    Act Density 0.247%

    No Known Activations