Person: de Bakker, Paul
Loading...
Email Address
AA Acceptance Date
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
de Bakker
First Name
Paul
Name
de Bakker, Paul
15 results
Search Results
Now showing 1 - 10 of 15
Publication Genome-wide association study identifies multiple susceptibility loci for diffuse large B cell lymphoma(Nature Publishing Group, 2014) Cerhan, James R.; Berndt, Sonja I.; Vijai, Joseph; Ghesquières, Hervé; McKay, James; Wang, Sophia S.; Wang, Zhaoming; Yeager, Meredith; Conde, Lucia; de Bakker, Paul; Nieters, Alexandra; Cox, David; Burdett, Laurie; Monnereau, Alain; Flowers, Christopher R.; De Roos, Anneclaire J.; Brooks-Wilson, Angela R.; Lan, Qing; Severi, Gianluca; Melbye, Mads; Gu, Jian; Jackson, Rebecca D.; Kane, Eleanor; Teras, Lauren R.; Purdue, Mark P.; Vajdic, Claire M.; Spinelli, John J.; Giles, Graham G.; Albanes, Demetrius; Kelly, Rachel; Zucca, Mariagrazia; Bertrand, Kimberly; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Hutchinson, Amy; Zhi, Degui; Habermann, Thomas M.; Link, Brian K.; Novak, Anne J.; Dogan, Ahmet; Asmann, Yan W.; Liebow, Mark; Thompson, Carrie A.; Ansell, Stephen M.; Witzig, Thomas E.; Weiner, George J.; Veron, Amelie S.; Zelenika, Diana; Tilly, Hervé; Haioun, Corinne; Molina, Thierry Jo; Hjalgrim, Henrik; Glimelius, Bengt; Adami, Hans-Olov; Bracci, Paige M.; Riby, Jacques; Smith, Martyn T.; Holly, Elizabeth A.; Cozen, Wendy; Hartge, Patricia; Morton, Lindsay M.; Severson, Richard K.; Tinker, Lesley F.; North, Kari E.; Becker, Nikolaus; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; Lightfoot, Tracy; Crouch, Simon; Smith, Alex; Roman, Eve; Diver, W. Ryan; Offit, Kenneth; Zelenetz, Andrew; Klein, Robert J.; Villano, Danylo J.; Zheng, Tongzhang; Zhang, Yawei; Holford, Theodore R.; Kricker, Anne; Turner, Jenny; Southey, Melissa C.; Clavel, Jacqueline; Virtamo, Jarmo; Weinstein, Stephanie; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Trichopoulos, Dimitrios; Vermeulen, Roel C. H.; Boeing, Heiner; Tjonneland, Anne; Angelucci, Emanuele; Di Lollo, Simonetta; Rais, Marco; Birmann, Brenda; Laden, Francine; Giovannucci, Edward; Kraft, Phillip; Huang, Jinyan; Ma, Baoshan; Ye, Yuanqing; Chiu, Brian C. H.; Sampson, Joshua; Liang, Liming; Park, Ju-Hyun; Chung, Charles C.; Weisenburger, Dennis D.; Chatterjee, Nilanjan; Fraumeni Jr, Joseph F.; Slager, Susan L.; Wu, Xifeng; de Sanjose, Silvia; Smedby, Karin E.; Salles, Gilles; Skibola, Christine F.; Rothman, Nathaniel; Chanock, Stephen J.Diffuse large B-cell lymphoma (DLBCL) is the most common lymphoma subtype and is clinically aggressive. To identify genetic susceptibility loci for DLBCL, we conducted a meta-analysis of three new genome-wide association studies (GWAS) and one prior scan, totaling 3,857 cases and 7,666 controls of European ancestry, with additional genotyping of nine promising SNPs in 1,359 cases and 4,557 controls. In our multi-stage analysis, five independent SNPs in four loci achieved genome-wide significance marked by rs116446171 at 6p25.3 (EXOC2; \(P=2.33x10^{-21}\)), rs2523607 at 6p21.33 (HLA-B; \(2.40x10^{-10}\)), rs79480871 at 2p23.3 (NCOA1; \(P=4.23x10^{-8}\)), and two independent SNPs, rs13255292 and rs4733601, at 8q24.21 (PVT1; \(P=9.98x10^{-13}\) and \(P=3.63x10^{-11}\), respectively). These data provide substantial new evidence for genetic susceptibility to this B-cell malignancy, and point towards pathways involved in immune recognition and immune function in the pathogenesis of DLBCL.Publication Negative selection in humans and fruit flies involves synergistic epistasis(Cold Spring Harbor Laboratory, 2016-07-29) Sohail, Mashaal; Vakhrusheva, Olga A; Sul, Jae; Pulit, Sara; Francioli, Laurent; van den Berg, Leonard H; Veldink, Jan H; de Bakker, Paul; Bazykin, Georgii A; Kondrashov, Alexey S; Sunyaev, ShamilAbstractNegative selection against deleterious alleles produced by mutation is the most common form of natural selection, which strongly influences within-population variation and interspecific divergence. However, some fundamental properties of negative selection remain obscure. In particular, it is still not known whether deleterious alleles affect fitness independently, so that cumulative fitness loss depends exponentially on the number of deleterious alleles, or synergistically, so that each additional deleterious allele results in a larger decrease in relative fitness. Negative selection with synergistic epistasis must produce negative linkage disequilibrium between deleterious alleles, and therefore, underdispersed distribution of the number of deleterious alleles in the genome. Indeed, we detected underdispersion of the number of rare loss-of-function (LoF) alleles in eight independent datasets from modern human andDrosophila melanogasterpopulations. Thus, ongoing selection against deleterious alleles is characterized by synergistic epistasis, which can explain how human and fly populations persist despite very high genomic deleterious mutation rates.Publication Amino Acid Position 11 of HLA-DRβ1 is a Major Determinant of Chromosome 6p Association with Ulcerative Colitis(2012) Achkar, Jean-Paul; Klei, Lambertus; de Bakker, Paul; Bellone, Gaia; Rebert, Nancy; Scott, Regan; Lu, Ying; Regueiro, Miguel; Brzezinski, Aaron; Kamboh, M. Ilyas; Fiocchi, Claudio; Devlin, Bernie; Trucco, Massimo; Ringquist, Steven; Roeder, Kathryn; Duerr, Richard HThe major histocompatibility complex (MHC) on chromosome 6p is an established risk locus for ulcerative colitis (UC) and Crohn’s disease (CD). We aimed to better define MHC association signals in UC and CD by combining data from dense single nucleotide polymorphism (SNP) genotyping and from imputation of classical HLA types, their constituent SNPs and corresponding amino acids in 562 UC, 611 CD, and 1,428 control subjects. Univariate and multivariate association analyses were performed, controlling for ancestry. In univariate analyses, absence of the rs9269955 C allele was strongly associated with risk for UC (P = 2.67×\(10^{-13}\)). rs9269955 is a SNP in the codon for amino acid position 11 of HLA-DRβ1, located in the P6 pocket of the HLA-DR antigen binding cleft. This amino acid position was also the most significantly UC-associated amino acid in omnibus tests (P = 2.68×\(10^{-13}\)). Multivariate modeling identified rs9269955-C and 13 other variants in best predicting UC versus control status. In contrast, there was only suggestive association evidence between the MHC and CD. Taken together, these data demonstrate that variation at HLA-DRβ1, amino acid 11 in the P6 pocket of the HLA-DR complex antigen binding cleft is a major determinant of chromosome 6p association with ulcerative colitis.Publication Common Variants in 40 Genes Assessed for Diabetes Incidence and Response to Metformin and Lifestyle Intervention in the Diabetes Prevention Program(American Diabetes Association, 2010) Jablonski, Kathleen A.; McAteer, Jarred B.; Franks, Paul W.; Pollin, Toni I.; Hanson, Robert L.; Fowler, Sarah; Shuldiner, Alan R.; Knowler, William C.; de Bakker, Paul; Saxena, Richa; Altshuler, David; Florez, JoseOBJECTIVE: Genome-wide association studies have begun to elucidate the genetic architecture of type 2 diabetes. We examined whether single nucleotide polymorphisms (SNPs) identified through targeted complementary approaches affect diabetes incidence in the at-risk population of the Diabetes Prevention Program (DPP) and whether they influence a response to preventive interventions. RESEARCH DESIGN AND METHODS: We selected SNPs identified by prior genome-wide association studies for type 2 diabetes and related traits, or capturing common variation in 40 candidate genes previously associated with type 2 diabetes, implicated in monogenic diabetes, encoding type 2 diabetes drug targets or drug-metabolizing/transporting enzymes, or involved in relevant physiological processes. We analyzed 1,590 SNPs for association with incident diabetes and their interaction with response to metformin or lifestyle interventions in 2,994 DPP participants. We controlled for multiple hypothesis testing by assessing false discovery rates. RESULTS: We replicated the association of variants in the metformin transporter gene \(SLC47A1\) with metformin response and detected nominal interactions in the AMP kinase (AMPK) gene \(STK11\), the AMPK subunit genes \(PRKAA1\) and \(PRKAA2\), and a missense SNP in \(SLC22A1\), which encodes another metformin transporter. The most significant association with diabetes incidence occurred in the AMPK subunit gene \(PRKAG2\) (hazard ratio 1.24, 95% CI 1.09–1.40, P = 7 × 10\(^{−4}\)). Overall, there were nominal associations with diabetes incidence at 85 SNPs and nominal interactions with the metformin and lifestyle interventions at 91 and 69 mostly nonoverlapping SNPs, respectively. The lowest \(P\) values were consistent with experiment-wide 33% false discovery rates. CONCLUSIONS: We have identified potential genetic determinants of metformin response. These results merit confirmation in independent samples.Publication GWAS Identifies Novel Susceptibility Loci on 6p21.32 and 21q21.3 for Hepatocellular Carcinoma in Chronic Hepatitis B Virus Carriers(Public Library of Science, 2012) Li, Shengping; Qian, Ji; Zhao, Wanting; Dai, Juncheng; Bei, Jin-Xin; Foo, Jia Nee; McLaren, Paul J.; Li, Zhiqiang; Yang, Jingmin; Shen, Feng; Yang, Jiamei; Li, Shuhong; Pan, Shandong; Li, Wenjin; Zhai, Xiangjun; Zhou, Boping; Shi, Lehua; Chen, Xinchun; Chu, Minjie; Yan, Yiqun; Cheng, Shuqun; Shen, Jiawei; Jia, Weihua; Liu, Jibin; Yang, Jiahe; Wen, Zujia; Li, Aijun; Zhang, Guoliang; Luo, Xianrong; Qin, Hongbo; Chen, Minshan; Lin, Dongxin; Shen, Hongbing; Wang, Hongyang; Zeng, Yi-Xin; Wu, Mengchao; Hu, Zhibin; Shi, Yongyong; Liu, Jianjun; Zhou, Weiping; Yang, Yuan; Liu, Li; Wang, Yi; Wang, Jun; Zhang, Ying; Wang, Hua; Jin, Li; He, Lin; de Bakker, PaulGenome-wide association studies (GWAS) have recently identified KIF1B as susceptibility locus for hepatitis B virus (HBV)–related hepatocellular carcinoma (HCC). To further identify novel susceptibility loci associated with HBV–related HCC and replicate the previously reported association, we performed a large three-stage GWAS in the Han Chinese population. 523,663 autosomal SNPs in 1,538 HBV–positive HCC patients and 1,465 chronic HBV carriers were genotyped for the discovery stage. Top candidate SNPs were genotyped in the initial validation samples of 2,112 HBV–positive HCC cases and 2,208 HBV carriers and then in the second validation samples of 1,021 cases and 1,491 HBV carriers. We discovered two novel associations at rs9272105 (HLA-DQA1/DRB1) on 6p21.32 (OR = 1.30, P = 1.13×\(10^{−19}\)) and rs455804 (GRIK1) on 21q21.3 (OR = 0.84, P = 1.86×\(10^{−8}\)), which were further replicated in the fourth independent sample of 1,298 cases and 1,026 controls (rs9272105: OR = 1.25, P = 1.71×\(10^{−4}\); rs455804: OR = 0.84, P = 6.92×\(10^{−3}\)). We also revealed the associations of HLA-DRB1*0405 and 0901*0602, which could partially account for the association at rs9272105. The association at rs455804 implicates GRIK1 as a novel susceptibility gene for HBV–related HCC, suggesting the involvement of glutamate signaling in the development of HBV–related HCC.Publication Novel Loci for Metabolic Networks and Multi-Tissue Expression Studies Reveal Genes for Atherosclerosis(Public Library of Science, 2012) Inouye, Michael; Ripatti, Samuli; Kettunen, Johannes; Lyytikäinen, Leo-Pekka; Oksala, Niku; Laurila, Pirkka-Pekka; Kangas, Antti J.; Soininen, Pasi; Savolainen, Markku J.; Viikari, Jorma; Kähönen, Mika; Perola, Markus; Salomaa, Veikko; Raitakari, Olli; Lehtimäki, Terho; Taskinen, Marja-Riitta; Järvelin, Marjo-Riitta; Ala-Korpela, Mika; Palotie, Aarno; de Bakker, PaulAssociation testing of multiple correlated phenotypes offers better power than univariate analysis of single traits. We analyzed 6,600 individuals from two population-based cohorts with both genome-wide SNP data and serum metabolomic profiles. From the observed correlation structure of 130 metabolites measured by nuclear magnetic resonance, we identified 11 metabolic networks and performed a multivariate genome-wide association analysis. We identified 34 genomic loci at genome-wide significance, of which 7 are novel. In comparison to univariate tests, multivariate association analysis identified nearly twice as many significant associations in total. Multi-tissue gene expression studies identified variants in our top loci, SERPINA1 and AQP9, as eQTLs and showed that SERPINA1 and AQP9 expression in human blood was associated with metabolites from their corresponding metabolic networks. Finally, liver expression of AQP9 was associated with atherosclerotic lesion area in mice, and in human arterial tissue both SERPINA1 and AQP9 were shown to be upregulated (6.3-fold and 4.6-fold, respectively) in atherosclerotic plaques. Our study illustrates the power of multi-phenotype GWAS and highlights candidate genes for atherosclerosis.Publication Deleterious Alleles in the Human Genome Are on Average Younger Than Neutral Alleles of the Same Frequency(Public Library of Science, 2013) Kiezun, Adam; Pulit, Sara L.; Francioli, Laurent C.; van Dijk, Freerk; Swertz, Morris; Boomsma, Dorret I.; van Duijn, Cornelia M.; Slagboom, P. Eline; van Ommen, G. J. B.; Wijmenga, Cisca; de Bakker, Paul; Sunyaev, ShamilLarge-scale population sequencing studies provide a complete picture of human genetic variation within the studied populations. A key challenge is to identify, among the myriad alleles, those variants that have an effect on molecular function, phenotypes, and reproductive fitness. Most non-neutral variation consists of deleterious alleles segregating at low population frequency due to incessant mutation. To date, studies characterizing selection against deleterious alleles have been based on allele frequency (testing for a relative excess of rare alleles) or ratio of polymorphism to divergence (testing for a relative increase in the number of polymorphic alleles). Here, starting from Maruyama's theoretical prediction (Maruyama T (1974), Am J Hum Genet USA 6:669–673) that a (slightly) deleterious allele is, on average, younger than a neutral allele segregating at the same frequency, we devised an approach to characterize selection based on allelic age. Unlike existing methods, it compares sets of neutral and deleterious sequence variants at the same allele frequency. When applied to human sequence data from the Genome of the Netherlands Project, our approach distinguishes low-frequency coding non-synonymous variants from synonymous and non-coding variants at the same allele frequency and discriminates between sets of variants independently predicted to be benign or damaging for protein structure and function. The results confirm the abundance of slightly deleterious coding variation in humans.Publication Common Genetic Variation Near the Phospholamban Gene Is Associated with Cardiac Repolarisation: Meta-Analysis of Three Genome-Wide Association Studies(Public Library of Science, 2009) Nolte, Ilja M.; Wallace, Chris; Newhouse, Stephen J.; Waggott, Daryl; Fu, Jingyuan; Soranzo, Nicole; Gwilliam, Rhian; Deloukas, Panos; Savelieva, Irina; Zheng, Dongling; Dalageorgou, Chrysoula; Farrall, Martin; Samani, Nilesh J.; Brown, Morris; Dominiczak, Anna; Lathrop, Mark; Zeggini, Eleftheria; Wain, Louise V.; Eijgelsheim, Mark; Pfeufer, Arne; Sanna, Serena; Arking, Dan E.; Asselbergs, Folkert W.; Spector, Tim D.; Carter, Nicholas D.; Jeffery, Steve; Tobin, Martin; Caulfield, Mark; Snieder, Harold; Munroe, Patricia B.; Jamshidi, Yalda; Connell, John; Newton-Cheh, Christopher; Rice, Ken; de Bakker, Paul; Paterson, Andrew D.To identify loci affecting the electrocardiographic QT interval, a measure of cardiac repolarisation associated with risk of ventricular arrhythmias and sudden cardiac death, we conducted a meta-analysis of three genome-wide association studies (GWAS) including 3,558 subjects from the TwinsUK and BRIGHT cohorts in the UK and the DCCT/EDIC cohort from North America. Five loci were significantly associated with QT interval at P<1×10\(^{−6}\). To validate these findings we performed an in silico comparison with data from two QT consortia: QTSCD (n = 15,842) and QTGEN (n = 13,685). Analysis confirmed the association between common variants near NOS1AP (P = 1.4×10\(^{−83}\)) and the phospholamban (PLN) gene (P = 1.9×10\(^{−29}\)). The most associated SNP near NOS1AP (rs12143842) explains 0.82% variance; the SNP near PLN (rs11153730) explains 0.74% variance of QT interval duration. We found no evidence for interaction between these two SNPs (P = 0.99). PLN is a key regulator of cardiac diastolic function and is involved in regulating intracellular calcium cycling, it has only recently been identified as a susceptibility locus for QT interval. These data offer further mechanistic insights into genetic influence on the QT interval which may predispose to life threatening arrhythmias and sudden cardiac death.Publication Effective Detection of Human Leukocyte Antigen Risk Alleles in Celiac Disease Using Tag Single Nucleotide Polymorphisms(Public Library of Science, 2008) Monsuur, Alienke J.; Zhernakova, Alexandra; Pinto, Dalila; Verduijn, Willem; Romanos, Jihane; Auricchio, Renata; Lopez, Ana; van Heel, David A.; Crusius, J. Bart A; Wijmenga, Cisca; de Bakker, PaulBackground: The HLA genes, located in the MHC region on chromosome 6p21.3, play an important role in many autoimmune disorders, such as celiac disease (CD), type 1 diabetes (T1D), rheumatoid arthritis, multiple sclerosis, psoriasis and others. Known HLA variants that confer risk to CD, for example, include DQA1*05/DQB1*02 (DQ2.5) and DQA1*03/DQB1*0302 (DQ8). To diagnose the majority of CD patients and to study disease susceptibility and progression, typing these strongly associated HLA risk factors is of utmost importance. However, current genotyping methods for HLA risk factors involve many reactions, and are complicated and expensive. We sought a simple experimental approach using tagging SNPs that predict the CD-associated HLA risk factors. Methodology: Our tagging approach exploits linkage disequilibrium between single nucleotide polymorphism (SNPs) and the CD-associated HLA risk factors DQ2.5 and DQ8 that indicate direct risk, and DQA1*0201/DQB1*0202 (DQ2.2) and DQA1*0505/DQB1*0301 (DQ7) that attribute to the risk of DQ2.5 to CD. To evaluate the predictive power of this approach, we performed an empirical comparison of the predicted DQ types, based on these six tag SNPs, with those executed with current validated laboratory typing methods of the HLA-DQA1 and -DQB1 genes in three large cohorts. The results were validated in three European celiac populations. Conclusion: Using this method, only six SNPs were needed to predict the risk types carried by >95% of CD patients. We determined that for this tagging approach the sensitivity was >0.991, specificity >0.996 and the predictive value >0.948. Our results show that this tag SNP method is very accurate and provides an excellent basis for population screening for CD. This method is broadly applicable in European populations.Publication Common Missense Variant in the Glucokinase Regulatory Protein Gene Is Associated With Increased Plasma Triglyceride and C-Reactive Protein but Lower Fasting Glucose Concentrations(American Diabetes Association, 2008) Orho-Melander, Marju; Melander, Olle; Guiducci, Candace; Perez-Martinez, Pablo; Corella, Dolores; Roos, Charlotta; Tewhey, Ryan; Rieder, Mark J.; Hall, Jennifer; Abecasis, Goncalo; Tai, E. Shyong; Welch, Cullan; Arnett, Donna K.; Lyssenko, Valeriya; Lindholm, Eero; Burtt, Noel; Voight, Benjamin F.; Tucker, Katherine L.; Hedner, Thomas; Tuomi, Tiinamaija; Isomaa, Bo; Eriksson, Karl-Fredrik; Taskinen, Marja-Riitta; Wahlstrand, Björn; Hughes, Thomas E.; Parnell, Laurence D.; Lai, Chao-Qiang; Berglund, Göran; Peltonen, Leena; Vartiainen, Erkki; Jousilahti, Pekka; Havulinna, Aki S.; Salomaa, Veikko; Nilsson, Peter; Groop, Leif; Ordovas, Jose M.; Kathiresan, Sekar; Saxena, Richa; de Bakker, Paul; Hirschhorn, Joel; Altshuler, DavidObjective: Using the genome-wide association approach, we recently identified the glucokinase regulatory protein gene (GCKR, rs780094) region as a novel quantitative trait locus for plasma triglyceride concentration in Europeans. Here, we sought to study the association of GCKR variants with metabolic phenotypes, including measures of glucose homeostasis, to evaluate the GCKR locus in samples of non-European ancestry and to fine-map across the associated genomic interval. Research Design and Methods: We performed association studies in 12 independent cohorts comprising >45,000 individuals representing several ancestral groups (whites from Northern and Southern Europe, whites from the U.S., African Americans from the U.S., Hispanics of Caribbean origin, and Chinese, Malays, and Asian Indians from Singapore). We conducted genetic fine-mapping across the ∼417-kb region of linkage disequilibrium spanning GCKR and 16 other genes on chromosome 2p23 by imputing untyped HapMap single nucleotide polymorphisms (SNPs) and genotyping 104 SNPs across the associated genomic interval. Results: We provide comprehensive evidence that GCKR rs780094 is associated with opposite effects on fasting plasma triglyceride (Pmeta = 3 × 10−56) and glucose (Pmeta = 1 × 10−13) concentrations. In addition, we confirmed recent reports that the same SNP is associated with C-reactive protein (CRP) level (P = 5 × 10−5). Both fine-mapping approaches revealed a common missense GCKR variant (rs1260326, Pro446Leu, 34% frequency, r2 = 0.93 with rs780094) as the strongest association signal in the region. Conclusions: These findings point to a molecular mechanism in humans by which higher triglycerides and CRP can be coupled with lower plasma glucose concentrations and position GCKR in central pathways regulating both hepatic triglyceride and glucose metabolism.