Publication:
A Comparison of Computational Methods for Identifying Virulence Factors

Thumbnail Image

Open/View Files

Date

2012

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Zheng, Lu-Lu, Yi-Xue Li, Juan Ding, Xiao-Kui Guo, Kai-Yan Feng, Ya-Jun Wang, Le-Le Hu, Yu-Dong Cai, Pei Hao, and Kuo-Chen Chou. 2012. A comparison of computational methods for identifying virulence factors. PLoS ONE 7(8): e42517.

Research Data

Abstract

Bacterial pathogens continue to threaten public health worldwide today. Identification of bacterial virulence factors can help to find novel drug/vaccine targets against pathogenicity. It can also help to reveal the mechanisms of the related diseases at the molecular level. With the explosive growth in protein sequences generated in the postgenomic age, it is highly desired to develop computational methods for rapidly and effectively identifying virulence factors according to their sequence information alone. In this study, based on the protein-protein interaction networks from the STRING database, a novel network-based method was proposed for identifying the virulence factors in the proteomes of UPEC 536, UPEC CFT073, P. aeruginosa PAO1, L. pneumophila Philadelphia 1, C. jejuni NCTC 11168 and M. tuberculosis H37Rv. Evaluated on the same benchmark datasets derived from the aforementioned species, the identification accuracies achieved by the network-based method were around 0.9, significantly higher than those by the sequence-based methods such as BLAST, feature selection and VirulentPred. Further analysis showed that the functional associations such as the gene neighborhood and co-occurrence were the primary associations between these virulence factors in the STRING database. The high success rates indicate that the network-based method is quite promising. The novel approach holds high potential for identifying virulence factors in many other various organisms as well because it can be easily extended to identify the virulence factors in many other bacterial species, as long as the relevant significant statistical data are available for them.

Description

Keywords

Biology, Biochemistry, Proteins, Protein Interactions, Computational Biology, Systems Biology, Microbiology, Bacteriology, Bacterial Biochemistry, Bacterial Pathogens, Proteomics, Medicine, Infectious Diseases, Bacterial Diseases

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories