INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     movement
    -2.77
     Movement
    -2.66
    movement
    -2.56
    Movement
    -2.42
     MOVEMENT
    -2.31
     Movements
    -2.22
     movements
    -2.20
     movimiento
    -1.97
     Bewegung
    -1.97
    movements
    -1.96
    POSITIVE LOGITS
     of
    0.79
     in
    0.68
     to
    0.64
     for
    0.62
     on
    0.58
    ,
    0.58
     (
    0.55
     and
    0.54
     d
    0.52
    .
    0.51
    Act Density 0.105%

    No Known Activations