INDEX
Explanations
statistics or facts related to gender disparities
occurrences of the word "are" in various contexts related to gender and inequality issues
New Auto-Interp
Negative Logits
xxxxxxxx
-0.72
ulence
-0.71
Telesc
-0.69
inventoryQuantity
-0.68
worthiness
-0.66
bye
-0.65
osate
-0.64
iversary
-0.64
transpired
-0.63
signifies
-0.62
POSITIVE LOGITS
accustomed
1.11
reluctant
1.10
aware
1.07
willing
1.04
eager
1.04
able
1.00
afraid
1.00
encouraged
1.00
unable
0.99
happiest
0.98
Activations Density 0.285%