INDEX
    Explanations

    occurrences of the word "push" and its variations

    New Auto-Interp
    Negative Logits
    иÑĩно
    -0.17
    adle
    -0.16
    utsch
    -0.16
    eker
    -0.15
    .Xaml
    -0.15
    RESS
    -0.15
    ãģıãĤī
    -0.15
     иÑģполн
    -0.14
    jac
    -0.14
    stown
    -0.14
    POSITIVE LOGITS
    (push
    0.28
    .Push
    0.26
     push
    0.25
    -push
    0.24
     Push
    0.22
    push
    0.21
    pull
    0.21
    Push
    0.21
     pushing
    0.21
    .push
    0.21
    Act Density 0.027%

    No Known Activations