Person:

MacArthur, Daniel

Loading...
Profile Picture

Email Address

AA Acceptance Date

Birth Date

Research Projects

Organizational Units

Job Title

Last Name

MacArthur

First Name

Daniel

Name

MacArthur, Daniel

Search Results

Now showing 1 - 9 of 9
  • Publication

    High-throughput discovery of novel developmental phenotypes

    (2016) Dickinson, Mary E.; Flenniken, Ann M.; Ji, Xiao; Teboul, Lydia; Wong, Michael D.; White, Jacqueline K.; Meehan, Terrence F.; Weninger, Wolfgang J.; Westerberg, Henrik; Adissu, Hibret; Baker, Candice N.; Bower, Lynette; Brown, James M.; Caddle, L. Brianna; Chiani, Francesco; Clary, Dave; Cleak, James; Daly, Mark; Denegre, James M.; Doe, Brendan; Dolan, Mary E.; Edie, Sarah M.; Fuchs, Helmut; Gailus-Durner, Valerie; Galli, Antonella; Gambadoro, Alessia; Gallegos, Juan; Guo, Shiying; Horner, Neil R.; Hsu, Chih-wei; Johnson, Sara J.; Kalaga, Sowmya; Keith, Lance C.; Lanoue, Louise; Lawson, Thomas N.; Lek, Monkol; Mark, Manuel; Marschall, Susan; Mason, Jeremy; McElwee, Melissa L.; Newbigging, Susan; Nutter, Lauryl M.J.; Peterson, Kevin A.; Ramirez-Solis, Ramiro; Rowland, Douglas J.; Ryder, Edward; Samocha, Kaitlin E.; Seavitt, John R.; Selloum, Mohammed; Szoke-Kovacs, Zsombor; Tamura, Masaru; Trainor, Amanda G; Tudose, Ilinca; Wakana, Shigeharu; Warren, Jonathan; Wendling, Olivia; West, David B.; Wong, Leeyean; Yoshiki, Atsushi; MacArthur, Daniel; Tocchini-Valentini, Glauco P.; Gao, Xiang; Flicek, Paul; Bradley, Allan; Skarnes, William C.; Justice, Monica J.; Parkinson, Helen E.; Moore, Mark; Wells, Sara; Braun, Robert E.; Svenson, Karen L.; de Angelis, Martin Hrabe; Herault, Yann; Mohun, Tim; Mallon, Ann-Marie; Henkelman, R. Mark; Brown, Steve D.M.; Adams, David J.; Lloyd, K.C. Kent; McKerlie, Colin; Beaudet, Arthur L.; Bucan, Maja; Murray, Stephen A.

    Approximately one third of all mammalian genes are essential for life. Phenotypes resulting from mouse knockouts of these genes have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5000 knockout mouse lines, we have identified 410 lethal genes during the production of the first 1751 unique gene knockouts. Using a standardised phenotyping platform that incorporates high-resolution 3D imaging, we identified novel phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes identified in our screen, thus providing a novel dataset that facilitates prioritization and validation of mutations identified in clinical sequencing efforts.

  • Publication

    Quantifying unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects

    (Nature Publishing Group, 2016) Zou, James; Valiant, Gregory; Valiant, Paul; Karczewski, Konrad; Chan, Siu On; Samocha, Kaitlin E.; Lek, Monkol; Sunyaev, Shamil; Daly, Mark; MacArthur, Daniel

    As new proposals aim to sequence ever larger collection of humans, it is critical to have a quantitative framework to evaluate the statistical power of these projects. We developed a new algorithm, UnseenEst, and applied it to the exomes of 60,706 individuals to estimate the frequency distribution of all protein-coding variants, including rare variants that have not been observed yet in the current cohorts. Our results quantified the number of new variants that we expect to identify as sequencing cohorts reach hundreds of thousands of individuals. With 500K individuals, we find that we expect to capture 7.5% of all possible loss-of-function variants and 12% of all possible missense variants. We also estimate that 2,900 genes have loss-of-function frequency of <0.00001 in healthy humans, consistent with very strong intolerance to gene inactivation.

  • Publication

    The ExAC browser: displaying reference data information from over 60 000 exomes

    (Oxford University Press, 2017) Karczewski, Konrad; Weisburd, Ben; Thomas, Brett; Solomonson, Matthew; Ruderfer, Douglas M.; Kavanagh, David; Hamamsy, Tymor; Lek, Monkol; Samocha, Kaitlin E.; Cummings, Beryl; Birnbaum, Daniel; Daly, Mark; MacArthur, Daniel

    Worldwide, hundreds of thousands of humans have had their genomes or exomes sequenced, and access to the resulting data sets can provide valuable information for variant interpretation and understanding gene function. Here, we present a lightweight, flexible browser framework to display large population datasets of genetic variation. We demonstrate its use for exome sequence data from 60 706 individuals in the Exome Aggregation Consortium (ExAC). The ExAC browser provides gene- and transcript-centric displays of variation, a critical view for clinical applications. Additionally, we provide a variant display, which includes population frequency and functional annotation data as well as short read support for the called variant. This browser is open-source, freely available at http://exac.broadinstitute.org, and has already been used extensively by clinical laboratories worldwide.

  • Publication

    Patterns of genic intolerance of rare copy number variation in 59,898 human exomes

    (2016) Ruderfer, Douglas M.; Hamamsy, Tymor; Lek, Monkol; Karczewski, Konrad; Kavanagh, David; Samocha, Kaitlin E.; Daly, Mark; MacArthur, Daniel; Fromer, Menachem; Purcell, Shaun M.

    Copy number variation (CNV) impacting protein-coding genes contributes significantly to human diversity and disease. Here we characterized the rates and properties of rare genic CNV (<0.5% frequency) in exome-sequencing data from nearly 60,000 individuals in the Exome Aggregation Consortium (ExAC). On average, individuals possessed 0.81 deleted and 1.75 duplicated genes, and most (70%) carried at least one rare genic CNV. For every gene, we empirically estimated an index of relative intolerance to CNVs that demonstrated moderate correlation with measures of genic constraint based on single-nucleotide variation (SNV) and was independently correlated with measures of evolutionary conservation. For individuals with schizophrenia, genes impacted by CNVs were more intolerant than in controls. ExAC CNV data constitutes a critical component of an integrated database spanning the spectrum of human genetic variation, aiding the interpretation of personal genomes as well as population-based disease studies. These data are freely available for download and visualization online.

  • Publication

    Analysis of protein-coding genetic variation in 60,706 humans

    (2016) Lek, Monkol; Karczewski, Konrad; Minikel, Eric; Samocha, Kaitlin E.; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark; MacArthur, Daniel

    Summary Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. We describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of truncating variants with 72% having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human “knockout” variants in protein-coding genes.

  • Publication

    A framework for the interpretation of de novo mutation in human disease

    (2014) Samocha, Kaitlin E.; Robinson, Elise; Sanders, Stephan J.; Stevens, Christine; Sabo, Aniko; McGrath, Lauren M.; Kosmicki, Jack; Rehnström, Karola; Mallick, Swapan; Kirby, Andrew; Wall, Dennis P.; MacArthur, Daniel; Gabriel, Stacey B.; dePristo, Mark; Purcell, Shaun M.; Palotie, Aarno; Boerwinkle, Eric; Buxbaum, Joseph D.; Cook, Edwin H.; Gibbs, Richard A.; Schellenberg, Gerard D.; Sutcliffe, James S.; Devlin, Bernie; Roeder, Kathryn; Neale, Benjamin; Daly, Mark

    Spontaneously arising (‘de novo’) mutations play an important role in medical genetics. For diseases with extensive locus heterogeneity – such as autism spectrum disorders (ASDs) – the signal from de novo mutations (DNMs) is distributed across many genes, making it difficult to distinguish disease-relevant mutations from background variation. We provide a statistical framework for the analysis of DNM excesses per gene and gene set by calibrating a model of de novo mutation. We applied this framework to DNMs collected from 1,078 ASD trios and – while affirming a significant role for loss-of-function (LoF) mutations – found no excess of de novo LoF mutations in cases with IQ above 100, suggesting that the role of DNMs in ASD may reside in fundamental neurodevelopmental processes. We also used our model to identify ~1,000 genes that are significantly lacking functional coding variation in non-ASD samples and are enriched for de novo LoF mutations identified in ASD cases.

  • Publication

    Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity

    (2017) Saleheen, Danish; Natarajan, Pradeep; Armean, Irina; Zhao, Wei; Rasheed, Asif; Khetarpal, Sumeet; Won, Hong-Hee; Karczewski, Konrad; O’Donnell-Luria, Anne H.; Samocha, Kaitlin E.; Weisburd, Benjamin; Gupta, Namrata; Zaidi, Mozzam; Samuel, Maria; Imran, Atif; Abbas, Shahid; Majeed, Faisal; Ishaq, Madiha; Akhtar, Saba; Trindade, Kevin; Mucksavage, Megan; Qamar, Nadeem; Zaman, Khan Shah; Yaqoob, Zia; Saghir, Tahir; Rizvi, Syed Nadeem Hasan; Memon, Anis; Mallick, Nadeem Hayyat; Ishaq, Mohammad; Rasheed, Syed Zahed; Memon, Fazal-ur-Rehman; Mahmood, Khalid; Ahmed, Naveeduddin; Do, Ron; Krauss, Ronald M.; MacArthur, Daniel; Gabriel, Stacey; Lander, Eric; Daly, Mark; Frossard, Philippe; Danesh, John; Rader, Daniel J.; Kathiresan, Sekar

    A major goal of biomedicine is to understand the function of every gene in the human genome.1 Loss-of-function (LoF) mutations can disrupt both copies of a given gene in humans and phenotypic analysis of such ‘human knockouts’ can provide insight into gene function. Consanguineous unions are more likely to result in offspring who carry LoF mutations in a homozygous state. In Pakistan, consanguinity rates are notably high.2 Here, we sequenced the protein-coding regions of 10,503 adult participants in the Pakistan Risk of Myocardial Infarction Study (PROMIS) designed to understand the determinants of cardiometabolic diseases in South Asians.3 We identified individuals carrying predicted LoF (pLoF) mutations in the homozygous state, and performed phenotypic analysis involving >200 biochemical and disease traits. We enumerated 49,138 rare (<1 % minor allele frequency) pLoF mutations. These pLoF mutations are predicted to knock out 1,317 genes in at least one participant. Homozygosity for pLoF mutations at PLAG27 was associated with absent enzymatic activity of soluble lipoprotein-associated phospholipase A2; at CYP2F1, with higher plasma interleukin-8 concentrations; at TREH, with lower concentrations of apoB-containing lipoprotein subfractions; at either A3GALT2 or NRG4, with markedly reduced plasma insulin C-peptide concentrations; and at SLC9A3R1, with mediators of calcium and phosphate signaling. Finally, APOC3 is a gene which retards clearance of plasma triglyceride-rich lipoproteins and where heterozygous deficiency confers protection against coronary heart disease.4,5 In Pakistan, we now observe APOC3 homozygous pLoF carriers; we recalled these knockout humans and challenged with an oral fat load. Compared with wild-type family members, APOC3 knockouts displayed marked blunting of the usual post-prandial rise in plasma triglycerides. Overall, these observations provide a roadmap for a ‘human knockout project’, a systematic effort to understand the phenotypic consequences of complete disruption of genes in humans.

  • Publication

    Estimating the Selective Effects of Heterozygous Protein Truncating Variants from Human Exome Data

    (2017) Cassa, Christopher; Weghorn, Donate; Balick, Daniel; Jordan, Daniel M.; Nusinow, David; Samocha, Kaitlin E.; O’Donnell-Luria, Anne; MacArthur, Daniel; Daly, Mark; Beier, David R.; Sunyaev, Shamil

    The dispensability of individual genes for viability has interested generations of geneticists. For some genes it is essential to maintain two functional chromosomal copies, while others may tolerate the loss of one or both copies. Exome sequence data from 60,706 individuals provide sufficient observations of rare protein truncating variants (PTVs) to make genome-wide estimates of selection against heterozygous loss of gene function. The cumulative frequency of rare deleterious PTVs is primarily determined by the balance between incoming mutations and purifying selection rather than genetic drift. This enables the estimation of the genome-wide distribution of selection coefficients for heterozygous PTVs and corresponding Bayesian estimates for individual genes. The strength of selection can discriminate the severity, age of onset, and mode of inheritance in Mendelian exome sequencing cases. We find that genes under the strongest selection are enriched in embryonic lethal mouse knockouts, putatively cell-essential genes, Mendelian disease genes, and regulators of transcription. Screening by essentiality, we find a large set of genes under strong selection that likely have critical function but have not yet been extensively annotated in published literature.

  • Publication

    Refining the role of de novo protein truncating variants in neurodevelopmental disorders using population reference samples

    (2017) Kosmicki, Jack; Samocha, Kaitlin E.; Howrigan, Daniel; Sanders, Stephan J.; Slowikowski, Kamil; Lek, Monkol; Karczewski, Konrad; Cutler, David J.; Devlin, Bernie; Roeder, Kathryn; Buxbaum, Joseph D.; Neale, Benjamin; MacArthur, Daniel; Wall, Dennis P.; Robinson, Elise; Daly, Mark

    Recent research has uncovered a significant role for de novo variation in neurodevelopmental disorders. Using aggregated data from 9246 families with autism spectrum disorder, intellectual disability, or developmental delay, we show ~1/3 of de novo variants are independently observed as standing variation in the Exome Aggregation Consortium’s cohort of 60,706 adults, and these de novo variants do not contribute to neurodevelopmental risk. We further use a loss-of-function (LoF)-intolerance metric, pLI, to identify a subset of LoF-intolerant genes that contain the observed signal of associated de novo protein truncating variants (PTVs) in neurodevelopmental disorders. LoF-intolerant genes also carry a modest excess of inherited PTVs; though the strongest de novo impacted genes contribute little to this, suggesting the excess of inherited risk resides lower-penetrant genes. These findings illustrate the importance of population-based reference cohorts for the interpretation of candidate pathogenic variants, even for analyses of complex diseases and de novo variation.