INDEX
Explanations
terms related to gender identity and diversity
phrases related to gender identity and expression
New Auto-Interp
Negative Logits
}}}
-0.75
æ©Ł
-0.72
rake
-0.68
Completed
-0.68
Eternity
-0.66
akings
-0.66
hao
-0.65
UNCH
-0.62
Construction
-0.61
çīĪ
-0.61
POSITIVE LOGITS
lesbian
0.95
lesbians
0.95
patriarchy
0.84
feminism
0.82
femin
0.79
marriage
0.79
Lesbian
0.78
feminist
0.78
shaming
0.78
unmarried
0.77
Activations Density 0.420%