INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _TRANS
    -0.09
    respons
    -0.08
    	stop
    -0.08
    TRIES
    -0.08
     자유
    -0.07
     Bucks
    -0.07
     NSObject
    -0.07
    尽力
    -0.07
    -0.07
     happening
    -0.07
    POSITIVE LOGITS
     LC
    0.07
    ϗ
    0.07
    0.07
     mixing
    0.06
    awan
    0.06
     promotional
    0.06
    込み
    0.06
    0.06
     hoch
    0.06
     PCR
    0.06
    Act Density 0.006%

    No Known Activations