Publication:

Bayesian Models to Identify Hidden Patterns with Applications in Biology

Loading...
Thumbnail Image

Date

2022-08-08

Authors

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Yan, Han. 2022. Bayesian Models to Identify Hidden Patterns with Applications in Biology. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Abstract

Technology advances have made possible the generation of massive amounts of data in biology. For example, whole genome sequencing (WGS) has made available (nearly) the entirety of DNA sequences of various organisms. Single-cell RNA se- quencing (scRNA-seq) technology measures expression levels of tens of thousands of genes in individual cells. They provide the scientific community huge opportunities as well as challenges in deciphering hidden information and understanding organisms at the micro-level. In this thesis, we will focus on modeling categorical data arise from several domains in biology. We will first introduce a Bayesian method that use aligned sequences from extant species on a species tree to infer DNA substitution rate shift patterns and identify candidate elements associated with a convergent phenotype in the presence of gene tree and species tree discordance. In the second part, we will focus on Bayesian bi-clustering methods, which simultaneous cluster samples and features. We will pro- pose four methods to model categorical data. These methods are designed to tackle different problems in biology, and have increased complexity on how features are mod- eled among object clusters. Though we focus on problems in biology, applications of our methods are broad.

Description

Other Available Sources

Research Data

Keywords

Bayesian Models, Biclustering, Computational biology, Genetics, Phylogenetics, Statistics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories