Show simple item record

dc.contributor.advisorLange, Christophen_US
dc.contributor.authorYip, Wai-Kien_US
dc.date.accessioned2015-03-18T13:10:36Z
dc.date.created2015-03en_US
dc.date.issued2015-01-13en_US
dc.date.submitted2015en_US
dc.identifier.citationYip, Wai-Ki. 2015. Statistical Methods for Analyzing DNA Methylation Data and Subpopulation Analysis of Continuous, Binary and Count Data for Clinical Trials. Doctoral dissertation, Harvard University, Graduate School of Arts & Sciences.en_US
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:14226106
dc.description.abstractDNA methylation may represent an important contributor to the missing heritability described in complex trait genetics. However, technology to measure DNA methylation has outpaced statistical methods for analysis. Novel methodologies are required to accommodate this growing volume of DNA methylation data. In this dissertation, I propose two novel methods to analyze DNA methylation data: (1) a new statistic based on spatial location information of DNA methylation sites to detect differentially methylated regions in the genome in case and control studies; and (2) a principal component approach for the detection of unknown substructure in DNA methylation data. For each method, I review existing ones and demonstrate the efficacy of my proposed method using simulation and data application. Medical research is increasingly focused on personalizing the care of patients. A better understanding of the interaction between treatment and patient specific prognostic factors will enable practitioners to expand the availability of tailored therapies improving patient outcomes. The Subpopulation Treatment Effect Pattern Plot (STEPP) approach was developed to allow researchers to investigate the heterogeneity of treatment effects on survival outcomes across increasing values of a continuously measured covariate, such as biomarker measurement. I extend the STEPP approach to continuous, binary and count outcomes which can be easily modeled with generalized linear models (GLM). The statistical significance of any observed heterogeneity of treatment effect is assessed using permutation tests. The method is implemented in the R software package (stepp) and is available in R version 3.1.1. The efficacy of my STEPP extension is demonstrated by using simulation and data application.en_US
dc.format.mimetypeapplication/pdfen_US
dc.language.isoenen_US
dash.licenseLAAen_US
dc.subjectBiology, Biostatisticsen_US
dc.subjectBiology, Bioinformaticsen_US
dc.subjectBiology, Geneticsen_US
dc.titleStatistical Methods for Analyzing DNA Methylation Data and Subpopulation Analysis of Continuous, Binary and Count Data for Clinical Trialsen_US
dc.typeThesis or Dissertationen_US
dash.depositing.authorYip, Wai-Kien_US
dc.date.available2015-03-18T13:10:36Z
thesis.degree.date2015en_US
thesis.degree.grantorGraduate School of Arts & Sciencesen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
dc.contributor.committeeMemberLaird, Nanen_US
dc.contributor.committeeMemberGelber, Richarden_US
dc.type.materialtexten_US
thesis.degree.departmentBiostatisticsen_US
dash.identifier.vireohttp://etds.lib.harvard.edu/gsas/admin/view/96en_US
dc.description.keywordsDNA Methylation; spatial location; principal component; Subpopulation analysis; clinical trial; Subpopulation Treatment Effect Pattern Plot (STEPP)en_US
dash.author.emailyuser86@yahoo.comen_US
dash.identifier.drsurn-3:HUL.DRS.OBJECT:25119327en_US
dash.contributor.affiliatedYip, Wai-Ki


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record