INDEX
    Explanations

    phrases related to significant changes or actions

    New Auto-Interp
    Negative Logits
     irré
    -0.74
    Jumping
    -0.65
     jumping
    -0.64
    Falling
    -0.61
     constamment
    -0.60
     touching
    -0.60
    Datuak
    -0.60
    jumping
    -0.57
     personlig
    -0.57
     bailando
    -0.56
    POSITIVE LOGITS
     walk
    1.09
     rise
    1.07
     hike
    1.04
     roll
    1.02
     move
    0.99
     climb
    0.98
     turn
    0.94
     drop
    0.92
     visit
    0.90
     push
    0.90
    Act Density 0.522%

    No Known Activations