INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     리스트
    -0.07
    _identity
    -0.07
    -0.06
     \|
    -0.06
     kans
    -0.06
     ----
    -0.06
    /
    -0.06
    	cell
    -0.06
    (player
    -0.06
    lazy
    -0.06
    POSITIVE LOGITS
     corres
    0.07
    won
    0.07
    .bid
    0.06
    xBD
    0.06
     فرد
    0.06
     fora
    0.06
     herhangi
    0.06
     gro
    0.06
     DialogResult
    0.06
    0.06
    Act Density 0.004%

    No Known Activations