INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     idéia
    -0.50
    ientras
    -0.49
    utuhkan
    -0.49
    Modelos
    -0.43
     للمعارف
    -0.42
    envolvimento
    -0.42
     asegurado
    -0.42
     khuy
    -0.41
    HostException
    -0.41
     Aún
    -0.41
    POSITIVE LOGITS
     life
    0.65
     Life
    0.62
    life
    0.61
    生活
    0.61
     vida
    0.60
    Life
    0.59
     LIFE
    0.50
     Leben
    0.50
     Living
    0.48
     living
    0.47
    Act Density 0.025%

    No Known Activations