INDEX
Explanations
terms related to transgender individuals
mentions of transgender individuals and related topics
New Auto-Interp
Negative Logits
spring
-0.91
Trust
-0.67
gio
-0.66
fare
-0.66
cv
-0.65
Lans
-0.65
eon
-0.65
steen
-0.64
EY
-0.64
zz
-0.64
POSITIVE LOGITS
transgender
1.22
gender
1.10
transsexual
1.09
Transgender
1.02
gender
0.91
genital
0.87
Nadu
0.84
istani
0.80
Dys
0.79
females
0.79
Activations Density 0.012%