INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    antaranya
    -0.60
     manguera
    -0.52
    hésite
    -0.50
     gynhyrchwyd
    -0.49
     húmedo
    -0.47
     antaranya
    -0.46
    empêcher
    -0.46
     defend
    -0.45
     templado
    -0.44
     représenter
    -0.43
    POSITIVE LOGITS
     arrival
    2.11
     Arrival
    1.98
    Arrival
    1.85
    arrival
    1.79
     arrivals
    1.60
     Arrivals
    1.36
     chegada
    1.24
    arrivée
    1.23
     llegada
    1.20
    Arri
    1.20
    Act Density 0.004%

    No Known Activations