Publication:

Whispers in the Data: Exploring Advice-Driven Sketching Algorithms

Loading...
Thumbnail Image

Date

2025-05-16

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Cui, Anthony. 2025. Whispers in the Data: Exploring Advice-Driven Sketching Algorithms. Bachelors Thesis, Harvard University Engineering and Applied Sciences.

Abstract

In modern systems, sketching algorithms are a common tool to analyze vast amounts of data. Recent work has explored ways to apply an advice model on top of these algorithms, where machine learning models provide hints when processing elements. This enables these sketching algorithms to provide even better performance as they are guided by the models.

This thesis focuses on these sketching algorithms that use advice. I examine and consider the limitations of existing algorithms, then design a new sketching framework to work around these limitations, which I dub the Bucketing sketch. Using this framework, I focus on three key computational challenges: estimating high-frequency moments for increment-only data streams, handling turnstile streams, and quantile estimation for aggregate values. For high-frequency moments (defined as $f(\boldx) = \sum x^d$ for $d > 2$), I prove that my approach can achieve precise estimates using logarithmic space, requiring only minimal assumptions about the underlying data-generating process. The remaining two problems are challenging for existing sketching algorithms; nevertheless, I show strong experimental results for all three problems, highlighting the flexibility and power of the Bucketing sketch.

Description

Other Available Sources

Research Data

Keywords

Computer science, Statistics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories