INDEX
    Explanations

    instances of the word "watch" and its variations in different contexts

    New Auto-Interp
    Negative Logits
     Abp
    -0.89
     للمعارف
    -0.84
    ])),
    -0.77
     Damian
    -0.75
    }}],
    -0.72
     Clough
    -0.72
     Merr
    -0.72
    (;;)
    -0.70
     Damien
    -0.69
    ']),
    -0.69
    POSITIVE LOGITS
     watch
    1.71
     WATCH
    1.68
     Watch
    1.65
     watches
    1.57
     Watches
    1.54
    watches
    1.50
    watch
    1.50
    Watch
    1.49
    WATCH
    1.48
     watched
    1.47
    Act Density 0.048%

    No Known Activations