INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     выпуск
    -0.07
    ACP
    -0.07
    -0.07
     BindingFlags
    -0.06
     açıl
    -0.06
    本身就
    -0.06
     questo
    -0.06
     glaring
    -0.06
     hiçbir
    -0.06
    POSITIVE LOGITS
    ails
    0.07
     المواد
    0.07
    .Code
    0.07
    DW
    0.07
    SACTION
    0.07
     Gut
    0.06
     средств
    0.06
     costa
    0.06
     pessoas
    0.06
    Strategy
    0.06
    Act Density 0.009%

    No Known Activations