INDEX
    Explanations

    comparisons and differences

    New Auto-Interp
    Negative Logits
     blant
    -0.08
     calon
    -0.08
     pion
    -0.08
    isitiri
    -0.07
     acud
    -0.07
    among
    -0.07
    Labs
    -0.07
     oni
    -0.07
    Ф
    -0.07
     proclamation
    -0.07
    POSITIVE LOGITS
    区别
    0.10
     separate
    0.10
     अलग
    0.10
     ayrı
    0.09
     અલગ
    0.09
     متفاوت
    0.09
     diferente
    0.09
     distinto
    0.09
     পৃথ
    0.08
     separados
    0.08
    Act Density 0.025%

    No Known Activations