Publication: Learning Low-Dimensional Structures in High Dimensions
Open/View Files
Date
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
Citation
Abstract
Although researchers nowadays deal with increasingly high-dimensional datasets, lower-dimensional structures often exist and learning them effectively play a crucial role. This dissertation contributes to this effort by developing theories and methodologies for learning low dimensional structures in high dimensions, developed by the author and collaborators. The first two chapters focus on learning one-dimensional functionals in the proportional asymptotics setting, where the number of observations grows proportionally with the number of features. The theories developed in this asymptotic regime reveals surprising new phenomena and exhibit remarkable accuracy in finite samples of high dimensions, where classical theories fail. Chapter 1 investigates heritability estimation under this framework. We study the joint distribution of debiased Lasso and Ridge estimators under proportional asymptotics, and consequently propose a heritability estimator that features an adaptive tuning procedure. Empirical studies confirm its efficiency and robustness compared to previous methods. Chapter 2 focus on transfer learning in linear models. Within proportional asymptotics framework, we precisely characterize the generalization error of min-norm interpolators under both design and model shifts, and confirm the finite sample accuracy via simulations. This analysis identifies regimes where transfer learning is beneficial, and provides relevant guidelines for practitioners. Chapter 3 shifts focus to unsupervised learning of high-dimensional Poisson process arrival data. We propose an end-to-end pipeline that not only reconstructs underlying (inhomogeneous) rate functions of Poisson processes, but also learns low-dimensional representations that are valuable for downstream learning tasks.