Person: Sofer, Tamar
Email Address
AA Acceptance Date
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
First Name
Name
Search Results
Publication Statistical Methods for High Dimensional Data in Environmental Genomics
(2013-02-11) Sofer, Tamar; Lin, Xihong; Coull, Brent; Schwartz, Joel; Cai, TianxiIn this dissertation, we propose methodology to analyze high dimensional genomics data, in which the observations have large number of outcome variables, in addition to exposure variables. In the Chapter 1, we investigate methods for genetic pathway analysis, where we have a small number of exposure variables. We propose two Canonical Correlation Analysis based methods, that select outcomes either sequentially or by screening, and show that the performance of the proposed methods depend on the correlation between the genes in the pathway. We also propose and investigate criterion for fixing the number of outcomes, and a powerful test for the exposure effect on the pathway. The methodology is applied to show that air pollution exposure affects gene methylation of a few genes from the asthma pathway. In Chapter 2, we study penalized multivariate regression as an efficient and flexible method to study the relationship between large number of covariates and multiple outcomes. We use penalized likelihood to shrink model parameters to zero and to select only the important effects. We use the Bayesian Information Criterion (BIC) to select tuning parameters for the employed penalty and show that it chooses the right tuning parameter with high probability. These are combined in the “two-stage procedure”, and asymptotic results show that it yields consistent, sparse and asymptotically normal estimator of the regression parameters. The method is illustrated on gene expression data in normal and diabetic patients. In Chapter 3 we propose a method for estimation of covariates-dependent principal components analysis (PCA) and covariance matrices. Covariates, such as smoking habits, can affect the variation in a set of gene methylation values. We develop a penalized regression method that incorporates covariates in the estimation of principal components. We show that the parameter estimates are consistent and sparse, and show that using the BIC to select the tuning parameter for the penalty functions yields good models. We also propose the scree plot residual variance criterion for selecting the number of principal components. The proposed procedure is implemented to show that the first three principal components of genes methylation in the asthma pathway are different in people who did not smoke, and people who did.
Publication Admixture mapping in the Hispanic Community Health Study/Study of Latinos reveals regions of genetic associations with blood pressure traits
(Public Library of Science, 2017) Sofer, Tamar; Baier, Leslie J.; Browning, Sharon R.; Thornton, Timothy A.; Talavera, Gregory A.; Wassertheil-Smoller, Sylvia; Daviglus, Martha L.; Hanson, Robert; Kobes, Sayuko; Cooper, Richard S.; Cai, Jianwen; Levy, Daniel; Reiner, Alex P.; Franceschini, NoraAdmixture mapping can be used to detect genetic association regions in admixed populations, such as Hispanics/Latinos, by estimating associations between local ancestry allele counts and the trait of interest. We performed admixture mapping of the blood pressure traits systolic and diastolic blood pressure (SBP, DBP), mean arterial pressure (MAP), and pulse pressure (PP), in a dataset of 12,116 participants from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). Hispanics/Latinos have three predominant ancestral populations (European, African, and Amerindian), for each of which we separately tested local ancestry intervals across the genome. We identified four regions that were significantly associated with a blood pressure trait at the genome-wide admixture mapping level. A 6p21.31 Amerindian ancestry association region has multiple known associations, but none explained the admixture mapping signal. We identified variants that completely explained this signal. One of these variants had p-values of 0.02 (MAP) and 0.04 (SBP) in replication testing in Pima Indians. A 11q13.4 Amerindian ancestry association region spans a variant that was previously reported (p-value = 0.001) in a targeted association study of Blood Pressure (BP) traits and variants in the vitamin D pathway. There was no replication evidence supporting an association in the identified 17q25.3 Amerindian ancestry association region. For a region on 6p12.3, associated with African ancestry, we did not identify any candidate variants driving the association. It may be driven by rare variants. Whole genome sequence data may be necessary to fine map these association signals, which may contribute to disparities in BP traits between diverse populations.
Publication GWAS of the electrocardiographic QT interval in Hispanics/Latinos generalizes previously identified loci and identifies population-specific signals
(Nature Publishing Group UK, 2017) Méndez-Giráldez, Raúl; Gogarten, Stephanie M.; Below, Jennifer E.; Yao, Jie; Seyerle, Amanda A.; Highland, Heather M.; Kooperberg, Charles; Soliman, Elsayed Z.; Rotter, Jerome I.; Kerr, Kathleen F.; Ryckman, Kelli K.; Taylor, Kent D.; Petty, Lauren E.; Shah, Sanjiv J.; Conomos, Matthew P.; Sotoodehnia, Nona; Cheng, Susan; Heckbert, Susan R.; Sofer, Tamar; Guo, Xiuqing; Whitsel, Eric A.; Lin, Henry J.; Hanis, Craig L.; Laurie, Cathy C.; Avery, Christy L.QT interval prolongation is a heritable risk factor for ventricular arrhythmias and can predispose to sudden death. Most genome-wide association studies (GWAS) of QT were performed in European ancestral populations, leaving other groups uncharacterized. Herein we present the first QT GWAS of Hispanic/Latinos using data on 15,997 participants from four studies. Study-specific summary results of the association between 1000 Genomes Project (1000G) imputed SNPs and electrocardiographically measured QT were combined using fixed-effects meta-analysis. We identified 41 genome-wide significant SNPs that mapped to 13 previously identified QT loci. Conditional analyses distinguished six secondary signals at NOS1AP (n = 2), ATP1B1 (n = 2), SCN5A (n = 1), and KCNQ1 (n = 1). Comparison of linkage disequilibrium patterns between the 13 lead SNPs and six secondary signals with previously reported index SNPs in 1000G super populations suggested that the SCN5A and KCNE1 lead SNPs were potentially novel and population-specific. Finally, of the 42 suggestively associated loci, AJAP1 was suggestively associated with QT in a prior East Asian GWAS; in contrast BVES and CAP2 murine knockouts caused cardiac conduction defects. Our results indicate that whereas the same loci influence QT across populations, population-specific variation exists, motivating future trans-ethnic and ancestrally diverse QT GWAS.
Publication Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries
(Public Library of Science, 2018) Feitosa, Mary F.; Kraja, Aldi T.; Chasman, Daniel; Sung, Yun J.; Winkler, Thomas W.; Ntalla, Ioanna; Guo, Xiuqing; Franceschini, Nora; Cheng, Ching-Yu; Sim, Xueling; Vojinovic, Dina; Marten, Jonathan; Musani, Solomon K.; Li, Changwei; Bentley, Amy R.; Brown, Michael R.; Schwander, Karen; Richard, Melissa A.; Noordam, Raymond; Aschard, Hugues; Bartz, Traci M.; Bielak, Lawrence F.; Dorajoo, Rajkumar; Fisher, Virginia; Hartwig, Fernando P.; Horimoto, Andrea R. V. R.; Lohman, Kurt K.; Manning, Alisa; Rankinen, Tuomo; Smith, Albert V.; Tajuddin, Salman M.; Wojczynski, Mary K.; Alver, Maris; Boissel, Mathilde; Cai, Qiuyin; Campbell, Archie; Chai, Jin Fang; Chen, Xu; Divers, Jasmin; Gao, Chuan; Goel, Anuj; Hagemeijer, Yanick; Harris, Sarah E.; He, Meian; Hsu, Fang-Chi; Jackson, Anne U.; Kähönen, Mika; Kasturiratne, Anuradhani; Komulainen, Pirjo; Kühnel, Brigitte; Laguzzi, Federica; Luan, Jian'an; Matoba, Nana; Nolte, Ilja M.; Padmanabhan, Sandosh; Riaz, Muhammad; Rueedi, Rico; Robino, Antonietta; Said, M. Abdullah; Scott, Robert A.; Sofer, Tamar; Stančáková, Alena; Takeuchi, Fumihiko; Tayo, Bamidele O.; van der Most, Peter J.; Varga, Tibor V.; Vitart, Veronique; Wang, Yajuan; Ware, Erin B.; Warren, Helen R.; Weiss, Stefan; Wen, Wanqing; Yanek, Lisa R.; Zhang, Weihua; Zhao, Jing Hua; Afaq, Saima; Amin, Najaf; Amini, Marzyeh; Arking, Dan E.; Aung, Tin; Boerwinkle, Eric; Borecki, Ingrid; Broeckel, Ulrich; Brown, Morris; Brumat, Marco; Burke, Gregory L.; Canouil, Mickaël; Chakravarti, Aravinda; Charumathi, Sabanayagam; Ida Chen, Yii-Der; Connell, John M.; Correa, Adolfo; de las Fuentes, Lisa; de Mutsert, Renée; de Silva, H. Janaka; Deng, Xuan; Ding, Jingzhong; Duan, Qing; Eaton, Charles B.; Ehret, Georg; Eppinga, Ruben N.; Evangelou, Evangelos; Faul, Jessica D.; Felix, Stephan B.; Forouhi, Nita G.; Forrester, Terrence; Franco, Oscar H.; Friedlander, Yechiel; Gandin, Ilaria; Gao, He; Ghanbari, Mohsen; Gigante, Bruna; Gu, C. Charles; Gu, Dongfeng; Hagenaars, Saskia P.; Hallmans, Göran; Harris, Tamara B.; He, Jiang; Heikkinen, Sami; Heng, Chew-Kiat; Hirata, Makoto; Howard, Barbara V.; Ikram, M. Arfan; John, Ulrich; Katsuya, Tomohiro; Khor, Chiea Chuen; Kilpeläinen, Tuomas O.; Koh, Woon-Puay; Krieger, José E.; Kritchevsky, Stephen B.; Kubo, Michiaki; Kuusisto, Johanna; Lakka, Timo A.; Langefeld, Carl D.; Langenberg, Claudia; Launer, Lenore J.; Lehne, Benjamin; Lewis, Cora E.; Li, Yize; Lin, Shiow; Liu, Jianjun; Liu, Jingmin; Loh, Marie; Louie, Tin; Mägi, Reedik; McKenzie, Colin A.; Meitinger, Thomas; Metspalu, Andres; Milaneschi, Yuri; Milani, Lili; Mohlke, Karen L.; Momozawa, Yukihide; Nalls, Mike A.; Nelson, Christopher P.; Sotoodehnia, Nona; Norris, Jill M.; O'Connell, Jeff R.; Palmer, Nicholette D.; Perls, Thomas; Pedersen, Nancy L.; Peters, Annette; Peyser, Patricia A.; Poulter, Neil; Raffel, Leslie J.; Raitakari, Olli T.; Roll, Kathryn; Rose, Lynda M.; Rosendaal, Frits R.; Rotter, Jerome I.; Schmidt, Carsten O.; Schreiner, Pamela J.; Schupf, Nicole; Scott, William R.; Sever, Peter S.; Shi, Yuan; Sidney, Stephen; Sims, Mario; Sitlani, Colleen M.; Smith, Jennifer A.; Snieder, Harold; Starr, John M.; Strauch, Konstantin; Stringham, Heather M.; Tan, Nicholas Y. Q.; Tang, Hua; Taylor, Kent D.; Teo, Yik Ying; Tham, Yih Chung; Turner, Stephen T.; Uitterlinden, André G.; Vollenweider, Peter; Waldenberger, Melanie; Wang, Lihua; Wang, Ya Xing; Wei, Wen Bin; Williams, Christine; Yao, Jie; Yu, Caizheng; Yuan, Jian-Min; Zhao, Wei; Zonderman, Alan B.; Becker, Diane M.; Boehnke, Michael; Bowden, Donald W.; Chambers, John C.; Deary, Ian J.; Esko, Tõnu; Farrall, Martin; Franks, Paul; Freedman, Barry I.; Froguel, Philippe; Gasparini, Paolo; Gieger, Christian; Jonas, Jost Bruno; Kamatani, Yoichiro; Kato, Norihiro; Kooner, Jaspal S.; Kutalik, Zoltán; Laakso, Markku; Laurie, Cathy C.; Leander, Karin; Lehtimäki, Terho; Study, Lifelines Cohort; Magnusson, Patrik K. E.; Oldehinkel, Albertine J.; Penninx, Brenda W. J. H.; Polasek, Ozren; Porteous, David J.; Rauramaa, Rainer; Samani, Nilesh J.; Scott, James; Shu, Xiao-Ou; van der Harst, Pim; Wagenknecht, Lynne E.; Wareham, Nicholas J.; Watkins, Hugh; Weir, David R.; Wickremasinghe, Ananda R.; Wu, Tangchun; Zheng, Wei; Bouchard, Claude; Christensen, Kaare; Evans, Michele K.; Gudnason, Vilmundur; Horta, Bernardo L.; Kardia, Sharon L. R.; Liu, Yongmei; Pereira, Alexandre C.; Psaty, Bruce M.; Ridker, Paul; van Dam, Rob M.; Gauderman, W. James; Zhu, Xiaofeng; Mook-Kanamori, Dennis O.; Fornage, Myriam; Rotimi, Charles N.; Cupples, L. Adrienne; Kelly, Tanika N.; Fox, Ervin R.; Hayward, Caroline; van Duijn, Cornelia M.; Tai, E Shyong; Wong, Tien Yin; Kooperberg, Charles; Palmas, Walter; Rice, Kenneth; Morrison, Alanna C.; Elliott, Paul; Caulfield, Mark J.; Munroe, Patricia B.; Rao, Dabeeru C.; Province, Michael A.; Levy, DanielHeavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10−5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10−8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10−8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension.
Publication Genetic architecture of lipid traits in the Hispanic community health study/study of Latinos
(BioMed Central, 2017) Graff, Mariaelisa; Emery, Leslie S.; Justice, Anne E.; Parra, Esteban; Below, Jennifer E.; Palmer, Nicholette D.; Gao, Chuan; Duan, Qing; Valladares-Salgado, Adan; Cruz, Miguel; Morrison, Alanna C.; Boerwinkle, Eric; Whitsel, Eric A.; Kooperberg, Charles; Reiner, Alex; Li, Yun; Rodriguez, Carlos Jose; Talavera, Gregory A.; Langefeld, Carl D.; Wagenknecht, Lynne E.; Norris, Jill M.; Taylor, Kent D.; Papanicolaou, George; Kenny, Eimear; Loos, Ruth J. F.; Chen, Yii-Der Ida; Laurie, Cathy; Sofer, Tamar; North, Kari E.Background: Despite ethnic disparities in lipid profiles, there are few genome-wide association studies investigating genetic variation of lipids in non-European ancestry populations. In this study, we present findings from genetic association analyses for total cholesterol, low density lipoprotein cholesterol (LDL), high density lipoprotein cholesterol (HDL), and triglycerides in a large Hispanic/Latino cohort in the U.S., the Hispanic Community Health Study / Study of Latinos (HCHS/SOL). Methods: We estimated a heritability of approximately 20% for each lipid trait, similar to previous estimates in Europeans. To search for novel lipid loci, we performed conditional association analysis in which the statistical model was adjusted for previously reported SNPs associated with any of the four lipid traits. SNPs that remained genome-wide significant (P < 5 × 10−8) after conditioning on known loci were evaluated for replication. Results: We identified eight potentially novel lipid signals with minor allele frequencies <1%, none of which replicated. We tested previously reported SNP-trait associations for generalization to Hispanics/Latinos via a statistical framework. The generalization analysis revealed that approximately 50% of previously established lipid variants generalize to HCHS/SOL based on directional FDR r-value < 0.05. Some failures to generalize were due to lack of power. Conclusions: These results demonstrate that many loci associated with lipid levels are shared across populations. Electronic supplementary material The online version of this article (10.1186/s12944-017-0591-6) contains supplementary material, which is available to authorized users.