INDEX
Explanations
words related to transgender individuals or issues
references to transgender individuals and related issues
New Auto-Interp
Negative Logits
pload
-0.77
Logged
-0.73
spring
-0.70
steen
-0.69
eer
-0.68
illes
-0.66
Thumbnail
-0.66
akings
-0.64
Indust
-0.63
Rove
-0.63
POSITIVE LOGITS
gender
1.04
bathroom
1.00
sexual
0.94
obic
0.94
bathrooms
0.94
restroom
0.93
identities
0.89
ph
0.83
restrooms
0.83
abled
0.82
Activations Density 0.039%