INDEX
Explanations
references to a spouse, specifically focusing on the word "wife."
references to the term "wife."
New Auto-Interp
Negative Logits
etting
-0.75
Flavoring
-0.74
orescent
-0.73
obyl
-0.70
oday
-0.70
umbn
-0.68
anyl
-0.65
ums
-0.65
constitu
-0.64
aneously
-0.63
POSITIVE LOGITS
wife
1.21
hood
0.88
cake
0.77
doctor
0.76
wife
0.75
gdala
0.74
cook
0.74
husband
0.73
Maria
0.73
maker
0.73
Activations Density 0.017%