Publication: Tree Preserving Embedding
Open/View Files
Date
2011
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
Proceedings of the National Academy of Sciences
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Shieh, Albert D., Tatsunori B. Hashimoto, and Edoardo M. Airoldi. 2011. Tree Preserving Embedding. Proceedings of the National Academy of Sciences 108, no. 41: 16916–16921.
Research Data
Abstract
The goal of dimensionality reduction is to embed high-dimensional data in a low-dimensional space while preserving structure in the data relevant to exploratory data analysis such as clusters. However, existing dimensionality reduction methods often either fail to separate clusters due to the crowding problem or can only separate clusters at a single resolution. We develop a new approach to dimensionality reduction: tree preserving embedding. Our approach uses the topological notion of connectedness to separate clusters at all resolutions. We provide a formal guarantee of cluster separation for our approach that holds for finite samples. Our approach requires no parameters and can handle general types of data, making it easy to use in practice and suggesting new strategies for robust data visualization.
Description
Other Available Sources
Keywords
hierarchical clustering, multidimensional scaling
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service