Publication: Computational identification of the selenocysteine tRNA (tRNASec) in genomes
Open/View Files
Date
2017
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Santesmasses, Didac, Marco Mariotti, and Roderic Guigó. 2017. “Computational identification of the selenocysteine tRNA (tRNASec) in genomes.” PLoS Computational Biology 13 (2): e1005383. doi:10.1371/journal.pcbi.1005383. http://dx.doi.org/10.1371/journal.pcbi.1005383.
Research Data
Abstract
Selenocysteine (Sec) is known as the 21st amino acid, a cysteine analogue with selenium replacing sulphur. Sec is inserted co-translationally in a small fraction of proteins called selenoproteins. In selenoprotein genes, the Sec specific tRNA (tRNASec) drives the recoding of highly specific UGA codons from stop signals to Sec. Although found in organisms from the three domains of life, Sec is not universal. Many species are completely devoid of selenoprotein genes and lack the ability to synthesize Sec. Since tRNASec is a key component in selenoprotein biosynthesis, its efficient identification in genomes is instrumental to characterize the utilization of Sec across lineages. Available tRNA prediction methods fail to accurately predict tRNASec, due to its unusual structural fold. Here, we present Secmarker, a method based on manually curated covariance models capturing the specific tRNASec structure in archaea, bacteria and eukaryotes. We exploited the non-universality of Sec to build a proper benchmark set for tRNASec predictions, which is not possible for the predictions of other tRNAs. We show that Secmarker greatly improves the accuracy of previously existing methods constituting a valuable tool to identify tRNASec genes, and to efficiently determine whether a genome contains selenoproteins. We used Secmarker to analyze a large set of fully sequenced genomes, and the results revealed new insights in the biology of tRNASec, led to the discovery of a novel bacterial selenoprotein family, and shed additional light on the phylogenetic distribution of selenoprotein containing genomes. Secmarker is freely accessible for download, or online analysis through a web server at http://secmarker.crg.cat.
Description
Other Available Sources
Keywords
Biology and life sciences, Biochemistry, Nucleic acids, RNA, Non-coding RNA, Transfer RNA, Biology and Life Sciences, Genetics, Genomics, Animal Genomics, Invertebrate Genomics, Database and Informatics Methods, Bioinformatics, Sequence Analysis, Sequence Alignment, Fungal Genetics, Fungal Genomics, Mycology, Biological Databases, Genomic Databases, Computational Biology, Genome Analysis, Microbiology, Archaean Biology, Gene Prediction, Computational Techniques, Split-Decomposition Method, Multiple Alignment Calculation
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service