INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     kept
    -0.08
     FAIL
    -0.07
     widened
    -0.07
    Palette
    -0.06
     thick
    -0.06
     brittle
    -0.06
     bbw
    -0.06
    .tbl
    -0.06
    .sys
    -0.06
     Mbps
    -0.06
    POSITIVE LOGITS
    oder
    0.07
    сих
    0.07
     resonance
    0.07
    ULSE
    0.07
    ase
    0.07
    elem
    0.06
    ูไ
    0.06
    ні
    0.06
    rear
    0.06
    esser
    0.06
    Act Density 0.002%

    No Known Activations