INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    b
    -0.06
     gm
    -0.06
    Eu
    -0.06
     Irving
    -0.06
     ylabel
    -0.06
    期間
    -0.06
     과정
    -0.06
     ecstasy
    -0.06
    //@
    -0.06
    00
    -0.06
    POSITIVE LOGITS
    uffs
    0.06
     sauces
    0.06
    цип
    0.06
    bag
    0.06
     Heroes
    0.06
    _AN
    0.06
     trước
    0.06
    significant
    0.06
    äch
    0.06
    0.06
    Act Density 0.012%

    No Known Activations