INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Depending
    -1.06
     Because
    -0.96
     Dispens
    -0.90
     Deploy
    -0.90
     Derived
    -0.89
     During
    -0.87
    Deploy
    -0.85
     Deployment
    -0.85
     Debt
    -0.85
     Depression
    -0.85
    POSITIVE LOGITS
     due
    2.56
     du
    1.18
     dus
    1.14
     دو
    1.10
     debido
    1.08
    Due
    1.05
     dua
    1.05
     duo
    1.05
    由于
    1.02
     devido
    0.99
    Act Density 0.090%

    No Known Activations