INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     indication
    -0.08
     nurse
    -0.07
     importantly
    -0.07
     acol
    -0.07
    上述
    -0.07
     Escort
    -0.07
     portions
    -0.07
     episc
    -0.07
     Supra
    -0.07
    downs
    -0.07
    POSITIVE LOGITS
    /company
    0.09
     dhow
    0.08
     Σύ
    0.08
    issingen
    0.08
    /The
    0.08
    ्ञ
    0.08
     homofil
    0.08
     ktorý
    0.08
     lemons
    0.08
     որպեսզի
    0.08
    Act Density 0.002%

    No Known Activations