Publication:
Tree Preserving Embedding

Thumbnail Image

Date

2011

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

Proceedings of the National Academy of Sciences
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Shieh, Albert D., Tatsunori B. Hashimoto, and Edoardo M. Airoldi. 2011. Tree Preserving Embedding. Proceedings of the National Academy of Sciences 108, no. 41: 16916–16921.

Research Data

Abstract

The goal of dimensionality reduction is to embed high-dimensional data in a low-dimensional space while preserving structure in the data relevant to exploratory data analysis such as clusters. However, existing dimensionality reduction methods often either fail to separate clusters due to the crowding problem or can only separate clusters at a single resolution. We develop a new approach to dimensionality reduction: tree preserving embedding. Our approach uses the topological notion of connectedness to separate clusters at all resolutions. We provide a formal guarantee of cluster separation for our approach that holds for finite samples. Our approach requires no parameters and can handle general types of data, making it easy to use in practice and suggesting new strategies for robust data visualization.

Description

Other Available Sources

Keywords

hierarchical clustering, multidimensional scaling

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories