INDEX
    Explanations

    phrases of advice or observations

    the word "you" in various contexts and its associated phrases

    New Auto-Interp
    Negative Logits
     è£ıè
    -0.63
    ãĥ³ãĤ¸
    -0.63
    pedia
    -0.63
    ENDED
    -0.61
    temp
    -0.61
    icum
    -0.61
     Verge
    -0.60
    aiden
    -0.60
    Joined
    -0.60
    IME
    -0.59
    POSITIVE LOGITS
    're
    1.41
     gotta
    1.38
     know
    1.19
    've
    1.18
     wanna
    1.13
     guys
    1.11
     realise
    1.08
     realize
    1.02
     cannot
    1.01
     want
    0.98
    Act Density 0.120%

    No Known Activations