Publication:

Connecting the Dots: Network Testing, Community Estimation, and Genomic Applications

Loading...
Thumbnail Image

Date

2024-05-09

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Cammarata, Louis Vincent. 2024. Connecting the Dots: Network Testing, Community Estimation, and Genomic Applications. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Abstract

This thesis presents three self-contained chapters: a global detection test in the mixed membership Stochastic Block Model (SBM), a community estimation algorithm in the dynamic degree-corrected mixed membership SBM, and a network-based study of the transcriptional control of cellular signal sensing.

Detecting the presence of structured communities is one of the most fundamental problems of statistical network analysis. In Chapter 1, we introduce a degree- and cycle count-based test statistic for global testing in the mixed-membership SBM, a common model for social networks. We derive its asymptotic null distribution and show that it is optimal for all choices of model parameters.

Studying the evolving structure of complex dynamic networks is becoming increasingly popular. In Chapter 2, we propose a spectral algorithm for dynamic node embedding and mixed membership estimation. We establish explicit error rates under smoothness assumptions on the temporal evolution of mixed memberships and potentially severe degree heterogeneity, showing that our method is rate optimal across a broad parameter range. We showcase its effectiveness on a trade network and a human contact network.

Network methods provide critical insights in many scientific disciplines, such as genomics. In Chapter 3, we report evidence of an important layer of transcriptional control of cellular signal sensing. Taking adhesion receptors as an example, we apply diverse network and statistical approaches to characterize the link between chromatin organization in the cell nucleus, gene co-regulation, and receptor proteins clustering.

Description

Other Available Sources

Research Data

Keywords

Community estimation, Genomic networks, Global detection, Network science, Statistics, Bioinformatics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories