INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gegen
    -0.07
     allies
    -0.07
     poru
    -0.07
     knot
    -0.06
     occurred
    -0.06
     forged
    -0.06
     spot
    -0.06
    ाख
    -0.06
    _listener
    -0.06
     PROP
    -0.06
    POSITIVE LOGITS
     Лит
    0.07
    (month
    0.06
    ُم
    0.06
    *',
    0.06
     thưởng
    0.06
    0.06
    ер
    0.06
    _long
    0.06
     Credential
    0.06
    출장
    0.06
    Act Density 0.012%

    No Known Activations