INDEX
Explanations
references to relationships with a female partner
references to girlfriends in various contexts
New Auto-Interp
Negative Logits
uchin
-0.78
inoc
-0.77
omin
-0.76
kefeller
-0.75
Interstitial
-0.74
ocratic
-0.70
constitu
-0.68
denomin
-0.67
anche
-0.66
SPONSORED
-0.66
POSITIVE LOGITS
girlfriend
1.14
gdala
1.07
girlfriend
1.04
boyfriend
0.97
girlfriends
0.90
wife
0.89
mistress
0.76
Isabel
0.75
waitress
0.75
Kate
0.74
Activations Density 0.008%