INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    })_{
    -0.75
    limus
    -0.74
     εύ
    -0.74
     للاسماء
    -0.68
    })));
    -0.67
    pendium
    -0.65
     Tare
    -0.65
    Dade
    -0.65
     Idy
    -0.65
     Cakes
    -0.65
    POSITIVE LOGITS
     women
    1.16
     Women
    1.12
     WOMEN
    1.08
    Women
    1.04
    women
    1.03
     woman
    0.98
    WOMEN
    0.98
     Woman
    0.96
     WOMAN
    0.94
    Woman
    0.89
    Act Density 0.049%

    No Known Activations