INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ยว
    -0.07
     sanki
    -0.06
     bunun
    -0.06
     hates
    -0.06
    具有
    -0.06
     CHtml
    -0.06
    )(__
    -0.06
    _prepare
    -0.06
    ^\
    -0.06
    (context
    -0.06
    POSITIVE LOGITS
    ewing
    0.07
     incl
    0.07
    Acceleration
    0.07
    intl
    0.06
     expenditure
    0.06
     اجرای
    0.06
    (Rect
    0.06
    BAT
    0.06
    metrics
    0.06
    (@"
    0.06
    Act Density 0.001%

    No Known Activations