INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Matthias
    -0.07
     hil
    -0.07
     adequate
    -0.06
     tedbir
    -0.06
    	Buffer
    -0.06
    ольш
    -0.06
     equalTo
    -0.06
     Osmanlı
    -0.06
    -0.06
    mod
    -0.06
    POSITIVE LOGITS
    -person
    0.09
     person
    0.08
     đứ
    0.07
    0.07
     witnessing
    0.06
    0.06
    .annotation
    0.06
    .jsoup
    0.06
     suburbs
    0.06
     Sind
    0.06
    Act Density 0.002%

    No Known Activations