Show simple item record

dc.contributor.authorLin, Daoyingen_US
dc.date.accessioned2014-09-17T17:30:30Z
dc.date.available2014-09-17T17:30:30Z
dc.date.issued2014-09-17
dc.date.submittedJanuary 2013en_US
dc.identifier.otherDISS-12354en_US
dc.identifier.urihttp://hdl.handle.net/10106/24764
dc.description.abstractWith the advances in human genome research, it is now believed that the risks of many complex diseases are triggered by the interplay of genetic susceptibilities and environmental exposures. The population-based case-control study (PBCCS) is widely used to investigate the role of genetic variants and environmental exposures in the etiology of complex diseases. There are numerous ways to implement the selection process of cases and controls. In its simplest form, a simple random sampling (SRS) design is used to choose cases and controls from diseased and disease-free population, respectively. Though SRS is easy to conduct and relevant statistical methodologies are well developed, more sophisticated complex sampling (like stratified, clustered, and multistage sampling) for the selection of cases and/or controls are needed for a number of reasons. First, complex sampling is more time and cost efficient than SRS. Second, representative sample can be chosen by conducting complex sampling and thusbiased selection of cases and/or controls could be avoided. As a result,complex sampling is now being used increasingly in large-scale population-based case-control or cross-sectional genetic association studies.The analysis of complex sampling data, however, requires special attention due to the following reasons. First, varying selection probabilities as well as adjustments for nonresponse and incomplete coverage of the population at risk result in differential population weight for each individual. Secondly, multistage clustered sampling design will induce non-negligible intra-cluster correlation. It has been well recognized that invalid inferences can be drawn if we ignore these two complications. There are very limited literature regarding PBCCS with complex sampling. Therefore there is a need to develop statistical methods for properly addressing those complication induced by complex sampling in genetic association studies.In this dissertation, we propose a series of innovative statistical methods for genetic association studies that account for various sampling designs. Robust variance estimators have been developed using the Taylor Linearization technique to incorporate differential weighting and clustering effect. Monte-Carlo simulation studies are utilized to study the properties of the proposed estimators under various sampling designs. The application of the proposed methods is also illustrated using the U.S. Kidney Cancer Study (USKCS), which is one of the largest PBCSS with genome available so far.en_US
dc.description.sponsorshipLi, Yanen_US
dc.language.isoenen_US
dc.publisherMathematicsen_US
dc.titleHaplotype-based Statistical Inference For Case-control Genetic Association Studies With Complex Samplingen_US
dc.typePh.D.en_US
dc.contributor.committeeChairLi, Yanen_US
dc.degree.departmentMathematicsen_US
dc.degree.disciplineMathematicsen_US
dc.degree.grantorUniversity of Texas at Arlingtonen_US
dc.degree.leveldoctoralen_US
dc.degree.namePh.D.en_US


Files in this item

Thumbnail


This item appears in the following Collection(s)

Show simple item record