INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     кожи
    -0.07
    _FWD
    -0.06
     insurance
    -0.06
     visc
    -0.06
     fingert
    -0.05
     живот
    -0.05
     mereka
    -0.05
     Insurance
    -0.05
     peptide
    -0.05
     теперь
    -0.05
    POSITIVE LOGITS
    thers
    0.07
    perimental
    0.07
    eşit
    0.07
    (concat
    0.07
    ErrorResponse
    0.07
    utations
    0.07
     TIMES
    0.07
    0.07
    0.07
    atest
    0.06
    Act Density 0.010%

    No Known Activations