INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     другого
    -1.16
    別の
    -1.10
    PARTE
    -1.05
     Enfin
    -1.05
    อื่น
    -1.03
     เก
    -1.02
    Autores
    -1.01
    另一個
    -1.01
     innych
    -0.99
    Udo
    -0.98
    POSITIVE LOGITS
     given
    3.58
     particular
    2.75
    given
    2.14
    Given
    1.82
     Given
    1.80
     certain
    1.64
     determinada
    1.63
     adott
    1.61
     determinado
    1.59
    particular
    1.56
    Act Density 0.246%

    No Known Activations