INDEX
    Explanations

    positive impressions

    New Auto-Interp
    Negative Logits
     العزيز
    -0.10
    rechte
    -0.09
    asteel
    -0.09
     unforgettable
    -0.09
    -lasting
    -0.09
     sufrido
    -0.09
     fesoasoani
    -0.09
    党委
    -0.08
    okuq
    -0.08
     ýü
    -0.08
    POSITIVE LOGITS
     promised
    0.10
     promising
    0.10
     enticing
    0.10
     advertised
    0.10
     reputed
    0.09
     promises
    0.09
     обещ
    0.09
     intriguing
    0.09
     intrigued
    0.08
     кажется
    0.08
    Act Density 0.201%

    No Known Activations