INDEX
    Explanations

    phrases indicating progression or movement through a sequence of events

    New Auto-Interp
    Negative Logits
     bì
    -0.15
    ç¯
    -0.14
    ourg
    -0.14
    éric
    -0.14
     hindsight
    -0.14
    ноÑĩ
    -0.14
    uze
    -0.13
     Redistributions
    -0.13
    åŀ
    -0.13
     ÏĥÏĦι
    -0.13
    POSITIVE LOGITS
     move
    0.64
     moving
    0.63
     Moving
    0.56
     moved
    0.55
    moving
    0.55
     moves
    0.54
     Move
    0.54
    Moving
    0.53
    move
    0.52
    Move
    0.50
    Act Density 0.167%

    No Known Activations