INDEX
Explanations
phrases related to LGBTQ+ rights and discrimination
New Auto-Interp
Negative Logits
зулта
-0.43
BagLayout
-0.42
rature
-0.42
betek
-0.41
큼
-0.41
aktur
-0.41
AssemblyCompany
-0.41
ofold
-0.40
CURIAM
-0.40
spesies
-0.40
POSITIVE LOGITS
supposedly
1.06
allegedly
0.92
purported
0.88
supuestamente
0.88
ostensibly
0.84
apparently
0.81
prétend
0.81
apparently
0.76
supposed
0.76
Apparently
0.75
Activations Density 0.907%