INDEX
    Explanations

    the word "you" with a strong match

    references to the word "you."

    New Auto-Interp
    Negative Logits
     WATCHED
    -0.63
    ulous
    -0.61
     )]
    -0.58
    ãĤ´ãĥ³
    -0.55
    ccording
    -0.54
    stad
    -0.54
    ãģ®å®
    -0.54
    Government
    -0.53
    âĢİ
    -0.52
    ischer
    -0.51
    POSITIVE LOGITS
     you
    2.68
    you
    2.17
     YOU
    1.94
    You
    1.69
     ya
    1.68
     You
    1.67
     your
    1.63
    YOU
    1.53
     yours
    1.45
     yourself
    1.37
    Act Density 0.291%

    No Known Activations