Publication:
Contextual Learning on Graphs for Precision Medicine

No Thumbnail Available

Date

2024-09-03

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Li, Michelle Min Rui. 2024. Contextual Learning on Graphs for Precision Medicine. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Research Data

Abstract

Precision medicine requires reasoning over interconnected data across multiple modalities to tailor medical decisions based on the context of individual patients. Graphs---or networks---are universal descriptors for systems of interacting elements, and deep learning on biomedical graphs has facilitated advancements in medicine, including accelerated disease gene prioritization and drug target identification. However, existing graph-based models are context-free: unable to adjust their outputs based on the contexts in which they operate. This PhD dissertation innovates two fundamental contextual learning algorithms, SHEPHERD and PINNACLE, to tackle medical questions for which patient and cell type contexts, respectively, are important. SHEPHERD addresses the challenge of low sample sizes among rare diseases by infusing patient data with external biomedical knowledge. It considers individual patients as unique subgraphs in a rare disease knowledge graph to learn patient-specific contexts derived from relationships such as genotype-phenotype and disease-gene associations, phenotype ontology, and genetic pathways. SHEPHERD's contextualized patient representations are optimized for multi-faceted rare disease diagnosis: performing causal gene discovery, retrieving "patients-like-me" with the same causal gene or disease, and providing interpretable characterizations of novel disease presentations. PINNACLE leverages cell-type-specific gene expression as well as cellular and tissue organization to resolve the role of a protein depending on the cell type context. It generates unique protein representations for every cell type context using cell-type-specific protein interaction networks constructed from single-cell transcriptomic atlases, and enforces the global organization of these representations with a metagraph of cell type communication and tissue hierarchy. PINNACLE's context-aware protein representations enable the analysis of drug effects across cell type contexts and the prediction of therapeutic targets in a cell-type-specific manner. Overall, this PhD dissertation pioneers the development of contextualized models to empower precision medicine, from rare disease diagnosis to drug discovery.

Description

Other Available Sources

Keywords

contextual learning, drug discovery, graph machine learning, precision medicine, rare disease diagnosis, single-cell transcriptomics, Bioinformatics, Computer science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories