dc.contributor.advisor Janson, Lucas LJ dc.contributor.advisor Liu, Jun JL dc.contributor.author Wang, Wenshuo dc.date.accessioned 2021-07-13T04:57:08Z dc.date.created 2021 dc.date.issued 2021-05-07 dc.date.submitted 2021-05 dc.identifier.citation Wang, Wenshuo. 2021. Overcoming the Curse: Statistical Theory and Methods in High Dimensions. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences. dc.identifier.other 28496851 dc.identifier.uri https://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37368273 * dc.description.abstract Battling the curse of dimensionality is a common theme in modern statistics. This dissertation contributes to this effort by surveying theory and methods in high-dimensinonal statistics developed by the author and collaborators, specifically in the areas of regression and sampling. In regression analysis, statisticians try to relate a response variable Y to a set of potential ex- planatory variables X = (X1,... , Xp), and start by trying to identify variables that contribute to this relationship. In statistical terms, this goal can be posed as trying to identify Xj’s upon which Y is conditionally dependent. Sometimes it is of value to simultaneously test for each j, which is more commonly known as variable selection. Classic techniques fail to accomplish these tasks when the number of features is too large. To address this challenge, the conditional randomization test and model-X knockoffs are two recently proposed methods that respectively per- form conditional independence testing and variable selection using knowledge of X’s distribution. Chapter 1 proposes the first generic knockoff sampling algorithm, which is the point of departure of implementing knockoffs, and investigates its optimality in terms of time complexity. Chapter 2 examines the power of the conditional randomization test and knockoffs in the high-dimensional regime in which the ratio of the dimension p and the sample size n converge to a positive constant. In sampling, statisticians try to draw samples from a target distribution. It is quite demanding to obtain representative samples from a high-dimensional distribution while restricting the sample size at a reasonable level. Sequential Monte Carlo and sequential quasi-Monte Carlo methods are powerful computational tools on this front. Chapter 3 studies these methods and derives new convergence rates in terms of the number of samples n. dc.format.mimetype application/pdf dc.language.iso en dash.license LAA dc.subject Statistics dc.title Overcoming the Curse: Statistical Theory and Methods in High Dimensions dc.type Thesis or Dissertation dash.depositing.author Wang, Wenshuo dc.date.available 2021-07-13T04:57:08Z thesis.degree.date 2021 thesis.degree.grantor Harvard University Graduate School of Arts and Sciences thesis.degree.level Doctoral thesis.degree.name Ph.D. dc.contributor.committeeMember Meng, Xiao-Li XM dc.type.material text thesis.degree.department Statistics dc.identifier.orcid 0000-0002-9896-0688 dash.author.email wenshuowang1997@gmail.com
﻿