INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _LIB
    -0.08
    -dismissible
    -0.08
    assen
    -0.07
    _none
    -0.07
    ้อย
    -0.07
    _flux
    -0.07
     Rue
    -0.07
     outbound
    -0.07
     Dere
    -0.07
     Helping
    -0.07
    POSITIVE LOGITS
    尺寸
    0.19
     dimensions
    0.16
     Dimensions
    0.16
     размеры
    0.15
    .width
    0.15
    Dimensions
    0.15
    .Width
    0.15
     dimensiones
    0.15
    (width
    0.14
     dims
    0.14
    Act Density 0.009%

    No Known Activations