Publication: An improved predictive recognition model for Cys2-His2 zinc finger proteins
Open/View Files
Date
2014
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
Oxford University Press
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Gupta, A., R. G. Christensen, H. A. Bell, M. Goodwin, R. Y. Patel, M. Pandey, M. S. Enuameh, et al. 2014. “An improved predictive recognition model for Cys2-His2 zinc finger proteins.” Nucleic Acids Research 42 (8): 4800-4812. doi:10.1093/nar/gku132. http://dx.doi.org/10.1093/nar/gku132.
Research Data
Abstract
Cys2-His2 zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities.
Description
Other Available Sources
Keywords
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service