INDEX
    Explanations

    actions or changes related to physical movement or adjustments

    New Auto-Interp
    Negative Logits
    translation
    -0.14
    plits
    -0.14
    asil
    -0.14
    onds
    -0.14
    пÑĤом
    -0.13
    à¸ģรรม
    -0.13
    enna
    -0.13
    378
    -0.13
    elas
    -0.13
    ras
    -0.12
    POSITIVE LOGITS
    -in
    0.51
    -In
    0.34
     inn
    0.33
    -IN
    0.32
     into
    0.32
    -i
    0.30
    -ins
    0.30
     ins
    0.30
     ин
    0.28
    -ln
    0.27
    Act Density 0.087%

    No Known Activations