INDEX
Explanations
references to childbirth and parenting
New Auto-Interp
Negative Logits
EMU
-0.14
emm
-0.14
ymoon
-0.14
utor
-0.14
ád
-0.14
spouse
-0.14
Burton
-0.14
serter
-0.13
cuckold
-0.13
ạ
-0.13
POSITIVE LOGITS
girl
0.85
girls
0.81
boy
0.74
Girl
0.73
Girls
0.71
-girl
0.71
boys
0.69
girl
0.66
GIR
0.65
Girl
0.65
Activations Density 0.234%