Publication:

On Nonparametric Guidance for Learning Autoencoder Representations

Loading...
Thumbnail Image

Date

2012

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

Microtome Publishing
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Snoek, Jasper, Ryan P. Adams, and Hugo Larochelle. 2012. On nonparametric guidance for learning autoencoder representations. In Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics: April 21-23, 2012 La Palma, Canary Islands, ed. Neil Lawrence and Mark Girolami, JMLR Workshop and Conference Proceedings 22:1073-1080.

Abstract

Unsupervised discovery of latent representations, in addition to being useful for density modeling, visualisation and exploratory data analysis, is also increasingly important for learning features relevant to discriminative tasks. Autoencoders, in particular, have proven to be an effective way to learn latent codes that reflect meaningful variations in data. A continuing challenge, however, is guiding an autoencoder toward representations that are useful for particular tasks. A complementary challenge is to find codes that are invariant to irrelevant transformations of the data. The most common way of introducing such problem-specific guidance in autoencoders has been through the incorporation of a parametric component that ties the latent representation to the label information. In this work, we argue that a preferable approach relies instead on a nonparametric guidance mechanism. Conceptually, it ensures that there exists a function that can predict the label information, without explicitly instantiating that function. The superiority of this guidance mechanism is con- firmed on two datasets. In particular, this approach is able to incorporate invariance information (lighting, elevation, etc.) from the small NORB object recognition dataset and yields state-of-the-art performance for a single layer, non-convolutional network.

Description

Other Available Sources

Research Data

Keywords

Terms of Use

This article is made available under the terms and conditions applicable to Open Access Policy Articles (OAP), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories