INDEX
Explanations
mentions of same-sex marriage
references to same-sex relationships and marriage
New Auto-Interp
Negative Logits
landfall
-0.74
arij
-0.65
è£ħ
-0.64
dilig
-0.63
auditory
-0.62
tremend
-0.61
Grizz
-0.61
©¶æ
-0.60
Bei
-0.60
NCT
-0.59
POSITIVE LOGITS
ually
0.99
sex
0.99
apore
0.83
ido
0.82
ercise
0.80
ily
0.78
piring
0.78
imo
0.78
gender
0.77
odus
0.77
Activations Density 0.010%