Publication:
A Near-Optimal Algorithm for L1-Difference

Thumbnail Image

Date

2009

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Nelson, Jelani and David P. Woodruff. 2009. A Near-Optimal Algorithm for L1-Difference. Working paper, School of Engineering and Applied Sciences, Harvard University.

Research Data

Abstract

We give the first L1-sketching algorithm for integer vectors which produces nearly optimal sized sketches in nearly linear time. This answers the first open problem in the list of open problems from the 2006 IITK Workshop on Algorithms for Data Streams. Specifically, suppose Alice receives a vector x ∈ {−M, . . . , M}n and Bob receives y ∈ {−M, . . . , M}n, and the two parties share randomness. Each party must output a short sketch of their vector such that a third party can later quickly recover a (1 ± ε)-approximation to ||x − y||1 with 2/3 probability given only the sketches. We give a sketching algorithm which produces O(ε−2log(1/ε) log(nM))-bit sketches in O(n log2(nM)) time, independent of ε. The previous best known sketching algorithm for L1 is due to [Feigenbaum et al., SICOMP 2002], which achieved the optimal sketch length of O(ε−2log(nM)) bits but had a running time of O(n log(nM)/ε2). Notice that our running time is near-linear for every ε, whereas for sufficiently small values of ε, the running time of the previous algorithm can be as large as quadratic. Like their algorithm, our sketching procedure also yields a small-space, one-pass streaming algorithm which works even if the entries of x, y are given in arbitrary order.

Description

Keywords

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories