INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Sachs
    -0.07
     stainless
    -0.06
     clinical
    -0.06
     Tunisia
    -0.06
     aroma
    -0.06
     zi
    -0.06
     blanks
    -0.06
     racial
    -0.06
     kendine
    -0.06
    ,const
    -0.06
    POSITIVE LOGITS
    noop
    0.15
     noop
    0.14
     NOP
    0.14
    nop
    0.13
    NOP
    0.12
     nop
    0.12
    _NOP
    0.10
    ноп
    0.07
    0.06
    ("-",
    0.06
    Act Density 0.001%

    No Known Activations