Show simple item record

dc.contributor.authorJacquin, Hugoen_US
dc.contributor.authorGilson, Amyen_US
dc.contributor.authorShakhnovich, Eugeneen_US
dc.contributor.authorCocco, Simonaen_US
dc.contributor.authorMonasson, Rémien_US
dc.date.accessioned2016-06-14T18:53:25Z
dc.date.issued2016en_US
dc.identifier.citationJacquin, Hugo, Amy Gilson, Eugene Shakhnovich, Simona Cocco, and Rémi Monasson. 2016. “Benchmarking Inverse Statistical Approaches for Protein Structure and Design with Exactly Solvable Models.” PLoS Computational Biology 12 (5): e1004889. doi:10.1371/journal.pcbi.1004889. http://dx.doi.org/10.1371/journal.pcbi.1004889.en
dc.identifier.issn1553-734Xen
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:27320431
dc.description.abstractInverse statistical approaches to determine protein structure and function from Multiple Sequence Alignments (MSA) are emerging as powerful tools in computational biology. However the underlying assumptions of the relationship between the inferred effective Potts Hamiltonian and real protein structure and energetics remain untested so far. Here we use lattice protein model (LP) to benchmark those inverse statistical approaches. We build MSA of highly stable sequences in target LP structures, and infer the effective pairwise Potts Hamiltonians from those MSA. We find that inferred Potts Hamiltonians reproduce many important aspects of ‘true’ LP structures and energetics. Careful analysis reveals that effective pairwise couplings in inferred Potts Hamiltonians depend not only on the energetics of the native structure but also on competing folds; in particular, the coupling values reflect both positive design (stabilization of native conformation) and negative design (destabilization of competing folds). In addition to providing detailed structural information, the inferred Potts models used as protein Hamiltonian for design of new sequences are able to generate with high probability completely new sequences with the desired folds, which is not possible using independent-site models. Those are remarkable results as the effective LP Hamiltonians used to generate MSA are not simple pairwise models due to the competition between the folds. Our findings elucidate the reasons for the success of inverse approaches to the modelling of proteins from sequence data, and their limitations.en
dc.language.isoen_USen
dc.publisherPublic Library of Scienceen
dc.relation.isversionofdoi:10.1371/journal.pcbi.1004889en
dc.relation.hasversionhttp://www.ncbi.nlm.nih.gov/pmc/articles/PMC4866778/pdf/en
dash.licenseLAAen_US
dc.subjectBiology and Life Sciencesen
dc.subjectMolecular Biologyen
dc.subjectMacromolecular Structure Analysisen
dc.subjectProtein Structureen
dc.subjectBiochemistryen
dc.subjectProteinsen
dc.subjectProtein Structure Predictionen
dc.subjectComputational Techniquesen
dc.subjectSplit-Decomposition Methoden
dc.subjectMultiple Alignment Calculationen
dc.subjectPhysical Sciencesen
dc.subjectPhysicsen
dc.subjectThermodynamicsen
dc.subjectEntropyen
dc.subjectMolecular Biology Techniquesen
dc.subjectSequencing Techniquesen
dc.subjectSequence Analysisen
dc.subjectSequence Alignmenten
dc.subjectProtein Structure Comparisonen
dc.subjectProtein Sequencingen
dc.subjectMathematical and Statistical Techniquesen
dc.subjectStatistical Methodsen
dc.subjectForecastingen
dc.subjectMathematicsen
dc.subjectStatistics (Mathematics)en
dc.titleBenchmarking Inverse Statistical Approaches for Protein Structure and Design with Exactly Solvable Modelsen
dc.typeJournal Articleen_US
dc.description.versionVersion of Recorden
dc.relation.journalPLoS Computational Biologyen
dash.depositing.authorGilson, Amyen_US
dc.date.available2016-06-14T18:53:25Z
dc.identifier.doi10.1371/journal.pcbi.1004889*
dash.contributor.affiliatedGilson, Amy Ilana
dash.contributor.affiliatedShakhnovich, Eugene


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record