INDEX
Explanations
references to or mentions of women
references to women
New Auto-Interp
Negative Logits
retri
-0.76
addon
-0.70
leon
-0.68
azes
-0.67
sprite
-0.65
Takeru
-0.64
awoken
-0.64
infect
-0.64
flipping
-0.63
emitting
-0.62
POSITIVE LOGITS
Women
3.65
Women
3.31
women
2.54
women
2.43
Woman
2.36
WOM
2.32
Ladies
2.08
Female
1.90
Girls
1.90
Woman
1.88
Activations Density 0.013%