INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Transactional
    -0.07
     prospective
    -0.07
     кого
    -0.07
     wann
    -0.07
     vocalist
    -0.07
     سوی
    -0.07
    -conf
    -0.07
    oun
    -0.06
     drummer
    -0.06
     boon
    -0.06
    POSITIVE LOGITS
    S
    0.09
    Sac
    0.07
    .putInt
    0.07
    DES
    0.07
       
    0.07
    res
    0.07
    ,S
    0.07
    —is
    0.06
    Ps
    0.06
     COPY
    0.06
    Act Density 0.026%

    No Known Activations