INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Zhao
    -0.07
     Tian
    -0.06
     Certain
    -0.06
    .MouseDown
    -0.06
    47
    -0.06
     что
    -0.06
    772
    -0.06
     использования
    -0.06
     scient
    -0.06
    (stypy
    -0.06
    POSITIVE LOGITS
     walk
    0.14
     Walk
    0.13
     Walker
    0.11
    walk
    0.11
     walks
    0.09
     walked
    0.09
     walking
    0.09
    Walk
    0.09
     Walking
    0.09
    Walker
    0.08
    Act Density 0.027%

    No Known Activations