INDEX
    Explanations

    words related to movement and transitions

    New Auto-Interp
    Negative Logits
    ark
    -0.16
    nder
    -0.15
    uilt
    -0.15
    ilder
    -0.14
    bas
    -0.14
    atas
    -0.14
    alo
    -0.14
    kem
    -0.14
     alb
    -0.14
     propos
    -0.14
    POSITIVE LOGITS
     into
    0.32
     onto
    0.30
    into
    0.26
    onto
    0.22
     Into
    0.21
     back
    0.21
     naar
    0.20
     vÃło
    0.19
    à¹Ħà¸Ľà¸¢
    0.19
    Into
    0.19
    Act Density 0.167%

    No Known Activations