Publication:

Group Symmetries in Diffusion Models: Formulation, Generalization, and Enforcement

Loading...
Thumbnail Image

Date

2025-05-22

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Bharthulwar, Siddharth. 2025. Group Symmetries in Diffusion Models: Formulation, Generalization, and Enforcement. Bachelors Thesis, Harvard University Engineering and Applied Sciences.

Abstract

Group symmetries are fundamental structures in many real-world datasets, and lever- aging them is crucial for building robust and data-efficient machine learning models. Diffusion models have achieved state-of-the-art performance in generative tasks but typically rely on standard neural network architectures for score estimation. This thesis investigates whether such standard configurations enable diffusion models to implicitly learn and generalize underlying data symmetries purely from examples, particularly when data is partially observed. Drawing motivation from Neural Tangent Kernel (NTK) theory, which suggests limitations in the ability of standard supervised networks to generalize symmetries beyond local data structure, we hypothesize and empirically demonstrate that score networks in diffusion models exhibit similar constraints. Using a 2D toy dataset with inherent SO(2) rotational symmetry, we show a consistent failure of standard models trained on incomplete data (interpolation and extrapolation settings) to generalize symmetry, exhibiting significant score field distortions in unobserved regions. To address this limitation, we propose and evaluate a novel per-timestep symmetry loss that regularizes the denoising process to encourage approximate equivariance. Empirical results on both the toy dataset and higher-dimensional MNIST data confirm that this loss significantly enhances symmetry generalization even in standard architectures, yielding geometrically consistent results comparable to extensive data augmentation. This work highlights a critical limitation in standard diffusion models and underscores the importance of incorporating explicit geometric biases, via architecture or regularization, for reliable generative modeling on structured data.

Description

Other Available Sources

Research Data

Keywords

diffusion models, equivariance, generalization, inductive bias, symmetry, Artificial intelligence, Statistics, Mathematics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories