INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     daher
    -0.08
    Open
    -0.07
     advancing
    -0.06
     clumsy
    -0.06
     conseils
    -0.06
    -0.06
    	mp
    -0.06
    softmax
    -0.06
     South
    -0.06
     branches
    -0.06
    POSITIVE LOGITS
    It
    0.07
    Ticket
    0.07
     installment
    0.07
     내용
    0.07
    it
    0.07
    'It
    0.07
    Pragma
    0.07
    ovit
    0.07
    ytt
    0.07
    iT
    0.07
    Act Density 0.063%

    No Known Activations