INDEX
    Explanations

    more negative qualifiers

    New Auto-Interp
    Negative Logits
     φο
    0.41
     фо
    0.41
    めます
    0.37
     בכל
    0.36
    க்கொள்ள
    0.36
     フォ
    0.36
     মন্ত্র
    0.35
     sleeve
    0.34
     bankruptcy
    0.34
    ари
    0.34
    POSITIVE LOGITS
     efectu
    0.48
     outperformed
    0.42
     produzido
    0.41
     produite
    0.41
     confirma
    0.40
     Accurate
    0.40
    证实
    0.40
     causado
    0.39
     უფრო
    0.39
     debido
    0.39
    Act Density 0.000%

    No Known Activations