INDEX
Explanations
mentions of transgender-related terms and issues
references to transgender individuals and issues
New Auto-Interp
Negative Logits
iser
-0.73
RPM
-0.68
ci
-0.68
arm
-0.66
!.
-0.65
gio
-0.65
Bug
-0.64
inia
-0.63
_.
-0.63
Stability
-0.61
POSITIVE LOGITS
transgender
3.62
Transgender
3.05
transsexual
2.55
LGBT
2.05
LGBTQ
2.04
lesbian
1.99
gender
1.94
LGBT
1.91
bisexual
1.88
homosexual
1.78
Activations Density 0.021%