INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     outreach
    -0.08
    Marketing
    -0.08
     warned
    -0.08
     formative
    -0.08
     தொழ
    -0.07
    ياجات
    -0.07
     Outreach
    -0.07
     chăm
    -0.07
     عملی
    -0.07
    warning
    -0.07
    POSITIVE LOGITS
    _value
    0.10
    0.10
    0.10
     값을
    0.10
    答案
    0.10
     numerator
    0.10
     계산
    0.09
     Numer
    0.09
    0.09
     numer
    0.09
    Act Density 0.104%

    No Known Activations