INDEX
    Explanations

    instances of the word "shift" and its variations, indicating a focus on changes or transitions

    New Auto-Interp
    Negative Logits
    erate
    -0.16
    uento
    -0.15
    ilet
    -0.15
    lek
    -0.14
    hide
    -0.14
    åĤĻ
    -0.14
    eled
    -0.14
    ervation
    -0.14
    hek
    -0.14
    ITY
    -0.14
    POSITIVE LOGITS
    shift
    0.21
     Shift
    0.21
     shifts
    0.20
     shift
    0.20
     sands
    0.19
    (shift
    0.19
    SHIFT
    0.19
    Shift
    0.19
    -shift
    0.19
     Hlav
    0.17
    Act Density 0.017%

    No Known Activations