Person: Chang, Hsiao-Han
Email Address
AA Acceptance Date
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
First Name
Name
Search Results
Publication Modeling malaria genomics reveals transmission decline and rebound in Senegal
(Proceedings of the National Academy of Sciences, 2015) Daniels, Rachel; Schaffner, Stephen; Wenger, Edward A.; Proctor, Joshua L.; Chang, Hsiao-Han; Wong, Wesley; Baro, Nicholas; Ndiaye, Daouda; Fall, Fatou Ba; Ndiop, Medoune; Ba, Mady; Milner, Danny; Taylor, Terrie E.; Neafsey, Daniel; Volkman, Sarah; Eckhoff, Philip A.; Hartl, Daniel; Wirth, DyannTo study the effects of malaria-control interventions on parasite population genomics, we examined a set of 1,007 samples of the malaria parasite Plasmodium falciparum collected in Thiès, Senegal between 2006 and 2013. The parasite samples were genotyped using a molecular barcode of 24 SNPs. About 35% of the samples grouped into subsets with identical barcodes, varying in size by year and sometimes persisting across years. The barcodes also formed networks of related groups. Analysis of 164 completely sequenced parasites revealed extensive sharing of genomic regions. In at least two cases we found first-generation recombinant offspring of parents whose genomes are similar or identical to genomes also present in the sample. An epidemiological model that tracks parasite genotypes can reproduce the observed pattern of barcode subsets. Quantification of likelihoods in the model strongly suggests a reduction of transmission from 2006-2010 with a significant rebound in 2012-2013. The reduced transmission and rebound were confirmed directly by incidence data from Thiès. These findings imply that intensive intervention to control malaria results in rapid and dramatic changes in parasite population genomics. The results also suggest that genomics combined with epidemiological modeling may afford prompt, continuous, and cost-effective tracking of progress toward malaria elimination.
Publication The Distribution of Pairwise Genetic Distances: A Tool for Investigating Disease Transmission
(Genetics Society of America, 2014) Worby, Colin; Chang, Hsiao-Han; Hanage, William; Lipsitch, MarcWhole-genome sequencing of pathogens has recently been used to investigate disease outbreaks and is likely to play a growing role in real-time epidemiological studies. Methods to analyze high-resolution genomic data in this context are still lacking, and inferring transmission dynamics from such data typically requires many assumptions. While recent studies have proposed methods to infer who infected whom based on genetic distance between isolates from different individuals, the link between epidemiological relationship and genetic distance is still not well understood. In this study, we investigated the distribution of pairwise genetic distances between samples taken from infected hosts during an outbreak. We proposed an analytically tractable approximation to this distribution, which provides a framework to evaluate the likelihood of particular transmission routes. Our method accounts for the transmission of a genetically diverse inoculum, a possibility overlooked in most analyses. We demonstrated that our approximation can provide a robust estimation of the posterior probability of transmission routes in an outbreak and may be used to rule out transmission events at a particular probability threshold. We applied our method to data collected during an outbreak of methicillin-resistant Staphylococcus aureus, ruling out several potential transmission links. Our study sheds light on the accumulation of mutations in a pathogen during an epidemic and provides tools to investigate transmission dynamics, avoiding the intensive computation necessary in many existing methods.
Publication Genomic variation and evolution of the human malaria parasite Plasmodium falciparum
(2013-10-08) Chang, Hsiao-Han; Hartl, Daniel L.; Wakeley, John; Hoekstra, Hopi; Neafsey, DanielMalaria is a deadly disease that causes nearly one million deaths each year. Understanding the demographic history of the malaria parasite Plasmodium falciparum and the genetic basis of its adaptations to antimalarial treatments and the human immune system is important for developing methods to control and eradicate malaria. To study the long-term demographic history and recent effective size of the population in order to identify genes under selection more efficiently and predict the effectiveness of selection, in Chapter 2 we sequenced the complete genomes of 25 cultured P. falciparum isolates from Senegal. In addition, in Chapter 3 we estimated temporal allele frequencies in 24 loci among 528 strains from the same population across six years. Based on genetic diversity of the genome sequences, we estimate the long-term effective population size to be approximately 100,000, and a major population expansion of the parasite population approximately 20,000-40,000 years ago. Based on temporal changes in allele frequencies, however, the recent effective size is estimated to be less than 100 from 2007-2011. The discrepancy may reflect recent aggressive efforts to control malaria in Senegal or migration between populations.
Publication Sequence-Based Association and Selection Scans Identify Drug Resistance Loci in the Plasmodium Falciparum Malaria Parasite
(Proceedings of the National Academy of Sciences, 2012) Park, Daniel John; Lukens, Amanda; Neafsey, Daniel; Schaffner, Stephen; Chang, Hsiao-Han; Valim, Clarissa; Ribacke, Ulf; Van tyne, Daria; Galinsky, Kevin; Galligan, Meghan; Becker, Justin S.; Ndiaye, Daouda; Mboup, Souleymane; Wiegand, Roger; Hartl, Daniel; Sabeti, Pardis; Wirth, Dyann; Volkman, SarahThrough rapid genetic adaptation and natural selection, the Plasmodium falciparum parasite—the deadliest of those that cause malaria—is able to develop resistance to antimalarial drugs, thwarting present efforts to control it. Genome-wide association studies (GWAS) provide a critical hypothesis-generating tool for understanding how this occurs. However, in P. falciparum, the limited amount of linkage disequilibrium hinders the power of traditional array-based GWAS. Here, we demonstrate the feasibility and power improvements gained by using whole-genome sequencing for association studies. We analyzed data from 45 Senegalese parasites and identified genetic changes associated with the parasites’ in vitro response to 12 different antimalarials. To further increase statistical power, we adapted a common test for natural selection, XP-EHH (cross-population extended haplotype homozygosity), and used it to identify genomic regions associated with resistance to drugs. Using this sequence-based approach and the combination of association and selection-based tests, we detected several loci associated with drug resistance. These loci included the previously known signals at pfcrt, dhfr, and pfmdr1, as well as many genes not previously implicated in drug-resistance roles, including genes in the ubiquitination pathway. Based on the success of the analysis presented in this study, and on the demonstrated shortcomings of array-based approaches, we argue for a complete transition to sequence-based GWAS for small, low linkage-disequilibrium genomes like that of P. falciparum.
Publication The Evolution and Functional Significance of Nested Gene Structures in Drosophila melanogaster
(Oxford University Press, 2013-10-01) Lee, Yuh Chwen G.; Chang, Hsiao-HanNearly ten percent of the genes in the genome of Drosophila melanogaster are in nested structures, in which one gene is completely nested within the intron of another gene (nested and including gene, respectively). Even though the coding sequences and UTRs of these nested/including gene pairs do not overlap, their intimate structures and the possibility of shared regulatory sequences raise questions about the evolutionary forces governing the origination, and subsequent functional and evolutionary impacts of these structures. In this study, we show that nested genes experience weaker evolutionary constraint, have faster rates of protein evolution and are expressed in fewer tissues than other genes, while including genes show the opposite patterns. Surprisingly, despite completely overlapping with each other, nested and including genes are less likely to display correlated gene expression and biological function than the nearby yet non-overlapping genes. Interestingly, significantly fewer nested genes are transcribed from the same strand as the including gene. We found that same-strand nested genes are more likely to be single-exon genes. In addition, same-strand including genes are less likely to have known lethal or sterile phenotypes than opposite-strand including genes only when the corresponding nested genes have introns. These results support our hypothesis that selection against potential erroneous mRNA splicing when nested and including genes are on the same strand plays an important role in the evolution of nested gene structures.
Publication Origin and Proliferation of Multiple-Drug Resistance in Bacterial Pathogens
(American Society for Microbiology, 2015) Chang, Hsiao-Han; Cohen, Ted; Grad, Yonatan; Hanage, William; O, Thomas F.; Lipsitch, MarcSUMMARY: Many studies report the high prevalence of multiply drug-resistant (MDR) strains. Because MDR infections are often significantly harder and more expensive to treat, they represent a growing public health threat. However, for different pathogens, different underlying mechanisms are traditionally used to explain these observations, and it is unclear whether each bacterial taxon has its own mechanism(s) for multidrug resistance or whether there are common mechanisms between distantly related pathogens. In this review, we provide a systematic overview of the causes of the excess of MDR infections and define testable predictions made by each hypothetical mechanism, including experimental, epidemiological, population genomic, and other tests of these hypotheses. Better understanding the cause(s) of the excess of MDR is the first step to rational design of more effective interventions to prevent the origin and/or proliferation of MDR.
Publication THE REAL McCOIL: A method for the concurrent estimation of the complexity of infection and SNP allele frequency for malaria parasites
(Public Library of Science, 2017) Chang, Hsiao-Han; Worby, Colin; Yeka, Adoke; Nankabirwa, Joaniter; Kamya, Moses R.; Staedke, Sarah G.; Dorsey, Grant; Murphy, Maxwell; Neafsey, Daniel E.; Jeffreys, Anna E.; Hubbart, Christina; Rockett, Kirk A.; Amato, Roberto; Kwiatkowski, Dominic P.; Buckee, Caroline; Greenhouse, BryanAs many malaria-endemic countries move towards elimination of Plasmodium falciparum, the most virulent human malaria parasite, effective tools for monitoring malaria epidemiology are urgent priorities. P. falciparum population genetic approaches offer promising tools for understanding transmission and spread of the disease, but a high prevalence of multi-clone or polygenomic infections can render estimation of even the most basic parameters, such as allele frequencies, challenging. A previous method, COIL, was developed to estimate complexity of infection (COI) from single nucleotide polymorphism (SNP) data, but relies on monogenomic infections to estimate allele frequencies or requires external allele frequency data which may not available. Estimates limited to monogenomic infections may not be representative, however, and when the average COI is high, they can be difficult or impossible to obtain. Therefore, we developed THE REAL McCOIL, Turning HEterozygous SNP data into Robust Estimates of ALelle frequency, via Markov chain Monte Carlo, and Complexity Of Infection using Likelihood, to incorporate polygenomic samples and simultaneously estimate allele frequency and COI. This approach was tested via simulations then applied to SNP data from cross-sectional surveys performed in three Ugandan sites with varying malaria transmission. We show that THE REAL McCOIL consistently outperforms COIL on simulated data, particularly when most infections are polygenomic. Using field data we show that, unlike with COIL, we can distinguish epidemiologically relevant differences in COI between and within these sites. Surprisingly, for example, we estimated high average COI in a peri-urban subregion with lower transmission intensity, suggesting that many of these cases were imported from surrounding regions with higher transmission intensity. THE REAL McCOIL therefore provides a robust tool for understanding the molecular epidemiology of malaria across transmission settings.
Publication Malaria Life Cycle Intensifies Both Natural Selection and Random Genetic Drift
(Proceedings of the National Academy of Sciences, 2013) Chang, Hsiao-Han; Moss, Eli L.; Park, Daniel John; Ndiaye, Daouda; Mboup, Souleymane; Volkman, Sarah; Sabeti, Pardis; Wirth, Dyann; Neafsey, Daniel; Hartl, DanielAnalysis of genome sequences of 159 isolates of Plasmodium falciparum from Senegal yields an extraordinarily high proportion (26.85%) of protein-coding genes with the ratio of nonsynonymous to synonymous polymorphism greater than one. This proportion is much greater than observed in other organisms. Also unusual is that the site-frequency spectra of synonymous and nonsynonymous polymorphisms are virtually indistinguishable. We hypothesized that the complicated life cycle of malaria parasites might lead to qualitatively different population genetics from that predicted from the classical Wright-Fisher (WF) model, which assumes a single random-mating population with a finite and constant population size in an organism with nonoverlapping generations. This paper summarizes simulation studies of random genetic drift and selection in malaria parasites that take into account their unusual life history. Our results show that random genetic drift in the malaria life cycle is more pronounced than under the WF model. Paradoxically, the efficiency of purifying selection in the malaria life cycle is also greater than under WF, and the relative efficiency of positive selection varies according to conditions. Additionally, the site-frequency spectrum under neutrality is also more skewed toward low-frequency alleles than expected with WF. These results highlight the importance of considering the malaria life cycle when applying existing population genetic tools based on the WF model. The same caveat applies to other species with similarly complex life cycles.
Publication Variation in infection length and superinfection enhance selection efficiency in the human malaria parasite
(Nature Publishing Group, 2016) Chang, Hsiao-Han; Childs, Lauren; Buckee, Caroline O.The capacity for adaptation is central to the evolutionary success of the human malaria parasite Plasmodium falciparum. Malaria epidemiology is characterized by the circulation of multiple, genetically diverse parasite clones, frequent superinfection, and highly variable infection lengths, a large number of which are chronic and asymptomatic. The impact of these characteristics on the evolution of the parasite is largely unknown, however, hampering our understanding of the impact of interventions and the emergence of drug resistance. In particular, standard population genetic frameworks do not accommodate variation in infection length or superinfection. Here, we develop a population genetic model of malaria including these variations, and show that these aspects of malaria infection dynamics enhance both the probability and speed of fixation for beneficial alleles in complex and non-intuitive ways. We find that populations containing a mixture of short- and long-lived infections promote selection efficiency. Interestingly, this increase in selection efficiency occurs even when only a small fraction of the infections are chronic, suggesting that selection can occur efficiently in areas of low transmission intensity, providing a hypothesis for the repeated emergence of drug resistance in the low transmission setting of Southeast Asia.
Publication Recurrent bottlenecks in the malaria life cycle obscure signals of positive selection
(Cambridge University Press (CUP), 2014) Chang, Hsiao-Han; Hartl, DanielDetecting signals of selection in the genome of malaria parasites is a key to identify targets for drug and vaccine development. Malaria parasites have a unique life cycle alternating between vector and host organism with a population bottleneck at each transition. These recurrent bottlenecks could influence the patterns of genetic diversity and the power of existing population genetic tools to identify sites under positive selection. We therefore simulated the site-frequency spectrum of a beneficial mutant allele through time under the malaria life cycle. We investigated the power of current population genetic methods to detect positive selection based on the site-frequency spectrum as well as temporal changes in allele frequency. We found that a within-host selective advantage is difficult to detect using these methods. Although a between-host transmission advantage could be detected, the power is decreased when compared with the classical Wright– Fisher (WF) population model. Using an adjusted null site-frequency spectrum that takes the malaria life cycle into account, the power of tests based on the site-frequency spectrum to detect positive selection is greatly improved. Our study demonstrates the importance of considering the life cycle in genetic analysis, especially in parasites with complex life cycles