INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     fy
    -0.07
     questo
    -0.06
     resemblance
    -0.06
    άνα
    -0.06
     Буд
    -0.06
    	matrix
    -0.06
     международ
    -0.06
     resembl
    -0.06
    ;k
    -0.06
     şi
    -0.06
    POSITIVE LOGITS
     discipl
    0.07
    apollo
    0.07
     getters
    0.06
    (mail
    0.06
    0.06
    .News
    0.06
    :before
    0.06
    -paper
    0.06
     Telecom
    0.06
     collecting
    0.06
    Act Density 0.004%

    No Known Activations