INDEX
    Explanations

    sentences that start with "You know," or similar phrases

    repetitive phrases that initiate with "You know."

    New Auto-Interp
    Negative Logits
    士
    -0.80
    omal
    -0.76
    aq
    -0.75
    entials
    -0.74
    erity
    -0.73
    pak
    -0.71
    rehend
    -0.70
    ãĤº
    -0.70
    uscript
    -0.69
    HL
    -0.67
    POSITIVE LOGITS
     uh
    0.80
     maybe
    0.79
     kinda
    0.72
     anecd
    0.71
    depending
    0.70
     sensing
    0.69
     sort
    0.65
     whatever
    0.65
    soType
    0.64
     yeah
    0.64
    Act Density 0.051%

    No Known Activations