INDEX
    Explanations

    pronouns related to gendered references, specifically focusing on "he," "she," and their variations

    New Auto-Interp
    Negative Logits
    SequentialGroup
    -0.48
    encodeWith
    -0.47
     cherchés
    -0.43
    uede
    -0.41
    Autoritní
    -0.40
    ContextHolder
    -0.39
     Nimbus
    -0.39
     '{@
    -0.38
    yntaxException
    -0.38
    etine
    -0.38
    POSITIVE LOGITS
    Portály
    0.58
     henkilö
    0.52
     createState
    0.50
     henkil
    0.50
     Volkes
    0.47
     iemand
    0.47
     alguien
    0.46
    __()
    0.46
     someone
    0.45
     Geister
    0.44
    Act Density 0.222%

    No Known Activations