INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     readily
    -0.07
     Obt
    -0.07
     interaction
    -0.06
     dend
    -0.06
     книги
    -0.06
     nit
    -0.06
    едини
    -0.06
     caves
    -0.06
    -0.06
     inch
    -0.06
    POSITIVE LOGITS
     Suppress
    0.09
     suppressed
    0.09
     suppress
    0.09
    opsis
    0.08
     suppressing
    0.08
     slump
    0.07
    abol
    0.07
    REV
    0.07
    .EditValue
    0.07
    sass
    0.07
    Act Density 0.010%

    No Known Activations