Publication:

Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm

Loading...
Thumbnail Image

Open/View Files

Date

2016

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Xu, Yaofang, Jiayi Wu, Chang-Cheng Yin, and Youdong Mao. 2016. “Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.” PLoS ONE 11 (12): e0167765. doi:10.1371/journal.pone.0167765. http://dx.doi.org/10.1371/journal.pone.0167765.

Abstract

In single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections of distinct orientations. Due to background noise in single-particle images and uncertainty of molecular orientations, traditional K-means clustering algorithm may classify images into wrong classes and produce classes with a large variation in membership. Overcoming these limitations requires further development on clustering algorithms for cryo-EM data analysis. We propose a novel unsupervised data clustering method building upon the traditional K-means algorithm. By introducing an adaptive constraint term in the objective function, our algorithm not only avoids a large variation in class sizes but also produces more accurate data clustering. Applications of this approach to both simulated and experimental cryo-EM data demonstrate that our algorithm is a significantly improved alterative to the traditional K-means algorithm in single-particle cryo-EM analysis.

Description

Research Data

Keywords

Microscopy, Electron Microscopy, Electron Cryo-Microscopy, Physical Sciences, Mathematics, Applied Mathematics, Algorithms, Clustering Algorithms, Simulation and Modeling, Computational Techniques, Split-Decomposition Method, Multiple Alignment Calculation, Biology and Life Sciences, Immunology, Immune System Proteins, Inflammasomes, Medicine and Health Sciences, Biochemistry, Proteins, Chemistry, Polymer Chemistry, Macromolecules, Computer and Information Sciences, Data Visualization, Infographics, Graphs

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories