INDEX
    Explanations

    references to gender, particularly focusing on men and women

    New Auto-Interp
    Negative Logits
     surla
    -1.08
    endphp
    -0.85
     /\.(
    -0.84
    fjspx
    -0.83
     autorytatywna
    -0.82
    
    -0.82
    ReusableCell
    -0.80
    onViewCreated
    -0.79
     BorderRadius
    -0.75
    jspx
    -0.74
    POSITIVE LOGITS
     xadrez
    0.50
     aprobado
    0.49
     letzten
    0.49
     interessiert
    0.48
     dernières
    0.48
     dedicato
    0.48
     devait
    0.47
     dovuto
    0.47
     correspondiente
    0.46
     Anybody
    0.46
    Act Density 0.020%

    No Known Activations