Publication:

Large Language Models and How We Train Them

Loading...
Thumbnail Image

Date

2025-05-16

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Personnat, Jayden Egbert. 2025. Large Language Models and How We Train Them. Bachelors Thesis, Harvard University Engineering and Applied Sciences.

Abstract

Large language models (LLMs) are the technological revolution of the decade, yet the mechanisms behind how they were developed or how they work fundamentally are often misunderstood by the public—and even debated among AI researchers and industry leaders—despite their popularity. As a result, for many individuals, these models have become unexplainable black boxes, which can lead to misuse or over-reliance on LLMs as people fail to grasp how they work or what their current limitations are. Due to a dramatic increase in AI discussion over the last five years, there is also the problem of information overload. There are many resources on AI from academic papers, blogs, news articles, and more that cover many different topics in a vast literature, but few unify these ideas into a coherent framework that explains how modern LLMs are built and trained. Many resources are also inaccessible and assume deep background knowledge, or focus on narrow implementation-level and mathematical details. This thesis is a structured, theoretical introduction to large language models that aims to combine theory with intuitive explanations and bridge the gap between understanding AI and its usage.

Description

Other Available Sources

Research Data

Keywords

Deep Learning, Large Language Models, Machine Learning, Reinforcement Learning, RLHF, Artificial intelligence

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories