INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     qua
    -0.09
     departed
    -0.08
     lava
    -0.08
    ിലുള്ള
    -0.08
     Ses
    -0.08
    truncate
    -0.08
     aprovado
    -0.07
     aprobado
    -0.07
    dd
    -0.07
    Approve
    -0.07
    POSITIVE LOGITS
     xảy
    0.09
    bildung
    0.09
     الإصابة
    0.09
     вероят
    0.08
     예방
    0.08
    Occurs
    0.08
    Occurrence
    0.08
     prevention
    0.08
     propensity
    0.08
    概率
    0.08
    Act Density 0.059%

    No Known Activations