Publication:

Learning in Neural Networks: Lazy training, Feature Learning, and Fine-Tuning

Loading...
Thumbnail Image

Date

2025-05-16

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Dayi, Arif Kerem. 2025. Learning in Neural Networks: Lazy training, Feature Learning, and Fine-Tuning. Bachelors Thesis, Harvard University Engineering and Applied Sciences.

Abstract

Neural networks trained on large amounts of data have found groundbreaking applications in language modeling, vision, and many other fields. The modern machine learning pipeline usually involves pre-training a model on a large, diverse dataset, and post-trained (e.g. fine tuned) on specialized downstream tasks. Models are able to learn good representations of the data in the pre-training stage, which is later tuned in the post-training stage. Despite the vast success of this pipeline, the exact mechanisms by which models are able to adapt their features to downstream tasks remains poorly understood.

In this thesis, we initially explore existing theoretical work on understanding questions related to over parametrization, generalization, and representation learning. To that end, we survey the literature on various mathematical techniques to answer these questions ranging from the neural tangent kernel and mean-field method to the drift martingale analysis; We do this while presenting original insights using self-contained examples and proofs.

Finally, we present original work on low-rank fine-tuning, which establishes a separation between the other learning regimes in the literature. In particular, we show that while fine-tuning is different than lazy training, it has a significantly lower sample and iteration complexity than full feature learning.

Description

Other Available Sources

Research Data

Keywords

feature learning, fine tuning, lazy training, machine learning, Neural networks, Computer science, Mathematics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories