INDEX
    Explanations

    instances of the word "watching."

    instances of the word "watching."

    New Auto-Interp
    Negative Logits
    esan
    -0.77
     misunderstanding
    -0.77
    phi
    -0.77
    cision
    -0.73
    orrect
    -0.67
    usable
    -0.66
    interstitial
    -0.66
    apolog
    -0.66
    enture
    -0.64
    oxy
    -0.64
    POSITIVE LOGITS
     Watching
    1.30
     watching
    1.17
     watched
    1.04
     wat
    0.90
    ÃįÃį
    0.89
     watch
    0.88
     watches
    0.85
    tower
    0.84
     Watch
    0.83
    watching
    0.83
    Act Density 0.016%

    No Known Activations