EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering

DSpace/Manakin Repository

EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering

Citable link to this page

 

 
Title: EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering
Author: Lee, Soohyun; Seo, Chae Hwa; Alver, Burak Han; Lee, Sanghyuk; Park, Peter J.

Note: Order does not necessarily reflect citation order of authors.

Citation: Lee, Soohyun, Chae Hwa Seo, Burak Han Alver, Sanghyuk Lee, and Peter J. Park. 2015. “EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering.” BMC Bioinformatics 16 (1): 278. doi:10.1186/s12859-015-0704-z. http://dx.doi.org/10.1186/s12859-015-0704-z.
Full Text & Related Files:
Abstract: Background: RNA-seq has been widely used for genome-wide expression profiling. RNA-seq data typically consists of tens of millions of short sequenced reads from different transcripts. However, due to sequence similarity among genes and among isoforms, the source of a given read is often ambiguous. Existing approaches for estimating expression levels from RNA-seq reads tend to compromise between accuracy and computational cost. Results: We introduce a new approach for quantifying transcript abundance from RNA-seq data. EMSAR (Estimation by Mappability-based Segmentation And Reclustering) groups reads according to the set of transcripts to which they are mapped and finds maximum likelihood estimates using a joint Poisson model for each optimal set of segments of transcripts. The method uses nearly all mapped reads, including those mapped to multiple genes. With an efficient transcriptome indexing based on modified suffix arrays, EMSAR minimizes the use of CPU time and memory while achieving accuracy comparable to the best existing methods. Conclusions: EMSAR is a method for quantifying transcripts from RNA-seq data with high accuracy and low computational cost. EMSAR is available at https://github.com/parklab/emsar Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0704-z) contains supplementary material, which is available to authorized users.
Published Version: doi:10.1186/s12859-015-0704-z
Other Sources: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559005/pdf/
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA
Citable link to this page: http://nrs.harvard.edu/urn-3:HUL.InstRepos:22856980
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)

 
 

Search DASH


Advanced Search
 
 

Submitters