INDEX
    Explanations

    movement and transition

    New Auto-Interp
    Negative Logits
    erto
    -0.12
     Hos
    -0.09
    uss
    -0.09
    aida
    -0.09
    illard
    -0.09
    ierz
    -0.08
    587
    -0.08
    à¸Ļà¸Ħ
    -0.08
    aval
    -0.08
     Fro
    -0.08
    POSITIVE LOGITS
     onto
    0.24
     into
    0.18
    onto
    0.16
     INTO
    0.14
     vÃło
    0.14
    åĩºæĿ¥
    0.14
     from
    0.13
     towards
    0.12
    _into
    0.12
     Ont
    0.12
    Act Density 0.075%

    No Known Activations