INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     confisc
    -0.07
    RadioButton
    -0.06
     изготов
    -0.06
    debian
    -0.06
    steady
    -0.06
    -0.06
    ادل
    -0.06
    adv
    -0.06
     ragaz
    -0.06
    OUSE
    -0.05
    POSITIVE LOGITS
    WIN
    0.07
     imperative
    0.07
    log
    0.07
    ваются
    0.06
     merge
    0.06
    0.06
     journée
    0.06
    .Euler
    0.06
     weigh
    0.06
     merging
    0.06
    Act Density 0.000%

    No Known Activations