INDEX
Explanations
terms and phrases related to LGBTQ+ issues and identities, particularly focusing on discrimination and representation
New Auto-Interp
Head Attr Weights
0:0.10
1:0.03
2:0.21
3:0.10
4:0.08
5:0.06
6:0.02
7:0.03
8:0.05
9:0.20
10:0.06
11:0.02
Negative Logits
BIL
-1.36
��極
-1.27
Completed
-1.24
Rated
-1.23
FUL
-1.21
��
-1.19
��
-1.19
alion
-1.17
��
-1.17
LESS
-1.14
POSITIVE LOGITS
eno
1.26
atheist
1.22
bia
1.20
marriage
1.19
arre
1.14
minster
1.13
enclave
1.12
reb
1.12
assert
1.12
orno
1.09
Activations Density 0.043%