Show simple item record

dc.contributor.authorIssa, Shadi A.
dc.contributor.authorKienzler, Romeo
dc.contributor.authorEl-Kalioby, Mohamed
dc.contributor.authorTonellato, Peter J.
dc.contributor.authorWall, Dennis
dc.contributor.authorBruggmann, Rémy
dc.contributor.authorAbouelhoda, Mohamed
dc.date.accessioned2013-12-06T16:02:44Z
dc.date.issued2013
dc.identifier.citationIssa, Shadi A., Romeo Kienzler, Mohamed El-Kalioby, Peter J. Tonellato, Dennis Wall, Rémy Bruggmann, and Mohamed Abouelhoda. 2013. Streaming support for data intensive cloud-based sequence analysis. BioMed Research International 2013:791051.en_US
dc.identifier.issn2314-6133en_US
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:11357459
dc.description.abstractCloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client's site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.en_US
dc.language.isoen_USen_US
dc.publisherHindawi Publishing Corporationen_US
dc.relation.isversionofdoi:10.1155/2013/791051en_US
dc.relation.hasversionhttp://www.ncbi.nlm.nih.gov/pmc/articles/PMC3655485/pdf/en_US
dash.licenseLAA
dc.titleStreaming Support for Data Intensive Cloud-Based Sequence Analysisen_US
dc.typeJournal Articleen_US
dc.description.versionVersion of Recorden_US
dc.relation.journalBioMed Research Internationalen_US
dash.depositing.authorTonellato, Peter J.
dc.date.available2013-12-06T16:02:44Z
dc.identifier.doi10.1155/2013/791051*
dash.contributor.affiliatedWall, Dennis Paul
dash.contributor.affiliatedTonellato, Peter


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record