INDEX
Explanations
pronouns related to gendered references, specifically focusing on "he," "she," and their variations
New Auto-Interp
Negative Logits
SequentialGroup
-0.48
encodeWith
-0.47
cherchés
-0.43
uede
-0.41
Autoritní
-0.40
ContextHolder
-0.39
Nimbus
-0.39
'{@-0.38
yntaxException
-0.38
etine
-0.38
POSITIVE LOGITS
Portály
0.58
henkilö
0.52
createState
0.50
henkil
0.50
Volkes
0.47
iemand
0.47
alguien
0.46
__()
0.46
someone
0.45
Geister
0.44
Activations Density 0.222%