Publication: A Near-Optimal Algorithm for L1-Difference
Date
2009
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Nelson, Jelani and David P. Woodruff. 2009. A Near-Optimal Algorithm for L1-Difference. Working paper, School of Engineering and Applied Sciences, Harvard University.
Research Data
Abstract
We give the first L1-sketching algorithm for integer vectors which produces nearly optimal
sized sketches in nearly linear time. This answers the first open problem in the list of open
problems from the 2006 IITK Workshop on Algorithms for Data Streams. Specifically, suppose Alice receives a vector x ∈ {−M, . . . , M}n and Bob receives y ∈ {−M, . . . , M}n, and the two
parties share randomness. Each party must output a short sketch of their vector such that a third
party can later quickly recover a (1 ± ε)-approximation to ||x − y||1 with 2/3 probability given
only the sketches. We give a sketching algorithm which produces O(ε−2log(1/ε) log(nM))-bit
sketches in O(n log2(nM)) time, independent of ε. The previous best known sketching algorithm
for L1 is due to [Feigenbaum et al., SICOMP 2002], which achieved the optimal sketch length
of O(ε−2log(nM)) bits but had a running time of O(n log(nM)/ε2). Notice that our running time is near-linear for every ε, whereas for sufficiently small values of ε, the running time of the previous algorithm can be as large as quadratic. Like their algorithm, our sketching procedure also yields a small-space, one-pass streaming algorithm which works even if the entries of x, y are given in arbitrary order.
Description
Other Available Sources
Keywords
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service