INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    in
    0.30
    eers
    0.30
    I
    0.29
    ed
    0.29
    D
    0.28
    다면
    0.28
    ür
    0.27
    0.27
    en
    0.26
    em
    0.26
    POSITIVE LOGITS
     sake
    0.31
     purposes
    0.29
     produksi
    0.24
    ked
    0.23
     repurchase
    0.23
     residentes
    0.22
     production
    0.22
     a
    0.22
     resale
    0.21
     comparison
    0.21
    Act Density 0.353%

    No Known Activations