Publication:

Streaming Algorithms for Code Sparsification

Loading...
Thumbnail Image

Date

2025-05-16

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Kuchlous, Sahil. 2025. Streaming Algorithms for Code Sparsification. Bachelors Thesis, Harvard University Engineering and Applied Sciences.

Abstract

The rapid growth of data collection and analysis in the 21st century has prompted the development of several new techniques in theoretical computer science. The streaming model, in which an algorithm has sequential access to a stream of input data and can only store a small fraction of the input in random access memory, has emerged as a powerful approach for processing datasets that are too large to store on a single system. From the perspective of algorithmic techniques, on the other hand, sparsification algorithms allow for the compression of large structures like graphs while preserving important global properties. These two ideas have a rich and interconnected history, and streaming algorithms for graph and hypergraph sparsification have become essential tools in the processing of massive datasets like social networks. Recently, a new framework called code sparsification has been developed as a method of generalizing existing sparsification algorithms through the lens of compressing matrices over finite fields. However, existing works only describe how to implement code sparsification in the classical computational setting, limiting its power. In this thesis, we develop the first algorithms for code sparsification in the insertion-only and dynamic streaming models. This provides a unified theory of graph and hypergraph sparsification over streams, and extends these techniques to allow for the space-efficient sparsification of Cayley graphs and a subset of constraint satisfaction problems.

Description

Other Available Sources

Research Data

Keywords

Computer science, Mathematics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories