INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    y
    -0.51
     Autorizaciones
    -0.50
    ності
    -0.49
    ように
    -0.48
     Freitag
    -0.48
     Freitas
    -0.46
     noastră
    -0.46
    substring
    -0.45
     nacido
    -0.45
    ambilan
    -0.45
    POSITIVE LOGITS
    ol
    1.01
    ols
    0.89
    iol
    0.87
    zol
    0.85
    Jol
    0.83
    jol
    0.83
    OL
    0.82
    nol
    0.81
    IOL
    0.78
    tol
    0.77
    Act Density 0.038%

    No Known Activations