INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    棋牌
    -0.07
     побач
    -0.07
     کنم
    -0.06
    _lengths
    -0.06
    вания
    -0.06
    ození
    -0.06
     nyní
    -0.06
     گرف
    -0.06
     Dün
    -0.06
     грудня
    -0.06
    POSITIVE LOGITS
     sued
    0.13
     suing
    0.11
     sue
    0.10
     Sue
    0.08
     lawsuit
    0.08
     investigating
    0.07
    .api
    0.07
     asynchronous
    0.07
     overturn
    0.07
    uding
    0.07
    Act Density 0.004%

    No Known Activations