INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bud
    -0.07
    'label
    -0.06
     fats
    -0.06
    ]+=
    -0.06
     стоит
    -0.06
     Ingredients
    -0.06
     Cycle
    -0.06
     cycle
    -0.06
     POW
    -0.06
     woke
    -0.06
    POSITIVE LOGITS
     değiş
    0.07
    ibr
    0.07
     seit
    0.07
    Enviar
    0.06
    0.06
    .SystemColors
    0.06
     krás
    0.06
    _via
    0.06
     boş
    0.06
    adece
    0.06
    Act Density 0.000%

    No Known Activations