INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     sodium
    -0.08
     Coordin
    -0.07
    -0.07
     kapsamında
    -0.07
     Behavioral
    -0.07
     onboard
    -0.07
     shield
    -0.07
     hydration
    -0.07
     kapsam
    -0.07
     Collision
    -0.07
    POSITIVE LOGITS
     setbacks
    0.14
     setback
    0.12
    失败
    0.10
     실패
    0.10
     perseverance
    0.09
     terribly
    0.09
    0.09
     miser
    0.09
     হত
    0.08
     થય
    0.08
    Act Density 0.023%

    No Known Activations