INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .Member
    -0.07
     Processes
    -0.06
    _et
    -0.06
     Mus
    -0.06
    _leg
    -0.06
     Produced
    -0.06
    ptoms
    -0.06
    js
    -0.06
     IDirect
    -0.06
     tup
    -0.06
    POSITIVE LOGITS
     можно
    0.13
     можна
    0.09
     необходимо
    0.09
     нельзя
    0.09
     można
    0.08
     Можно
    0.08
     يمكن
    0.07
    Included
    0.07
     треба
    0.07
     солн
    0.07
    Act Density 0.019%

    No Known Activations