INDEX
Explanations
mentions of spouses or partners, specifically focusing on husbands
mentions of the term "husband."
New Auto-Interp
Negative Logits
obyl
-0.72
XP
-0.70
Flavoring
-0.70
yss
-0.69
handc
-0.68
skating
-0.64
contam
-0.64
govtrack
-0.62
UGE
-0.62
anche
-0.61
POSITIVE LOGITS
husband
1.03
husband
0.96
friend
0.90
hood
0.84
wife
0.83
pins
0.80
fian
0.74
mate
0.74
bed
0.74
nesday
0.73
Activations Density 0.009%