INDEX
    Explanations

    the word "watch" in various contexts

    mentions of the verb "watch."

    New Auto-Interp
    Negative Logits
    interstitial
    -0.74
     misunderstanding
    -0.72
    ãĤ¨ãĥ«
    -0.69
    pse
    -0.66
     hemorrh
    -0.66
    activation
    -0.66
    phi
    -0.65
    ãĥ´ãĤ¡
    -0.64
    xual
    -0.64
    ctrl
    -0.63
    POSITIVE LOGITS
    tower
    1.28
     watch
    1.15
     Watching
    1.09
     watches
    1.06
     Watch
    1.04
    watch
    1.01
     watching
    0.94
    dogs
    0.91
    Watch
    0.88
     watched
    0.88
    Act Density 0.014%

    No Known Activations