INDEX
Explanations
mentions of the word "transgender" in the text
references to transgender individuals and issues related to their rights
New Auto-Interp
Negative Logits
Manufacturer
-0.74
steen
-0.74
fare
-0.73
spring
-0.71
gio
-0.67
rpm
-0.67
Kers
-0.66
Rove
-0.65
hower
-0.65
Lans
-0.64
POSITIVE LOGITS
gender
1.15
transgender
0.98
transsexual
0.97
sexual
0.94
genital
0.88
genders
0.85
gender
0.84
restroom
0.81
bathroom
0.80
ethnic
0.80
Activations Density 0.012%