INDEX
Explanations
mentions of same-sex marriage
phrases and terms related to same-sex marriage
New Auto-Interp
Negative Logits
Tiff
-0.67
almonds
-0.67
Mog
-0.65
IPM
-0.63
Loren
-0.63
brim
-0.63
Tens
-0.62
Wolfgang
-0.62
Bild
-0.58
Rodrigo
-0.58
POSITIVE LOGITS
sex
1.65
gender
1.43
Sex
1.22
origin
1.19
sized
1.13
species
1.10
colored
1.08
sided
1.08
family
1.07
race
1.05
Activations Density 0.011%