Publication:

Self-Structuring Machine Learning Models

Loading...
Thumbnail Image

Date

2021-06-23

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Contreras, Nathan. 2021. Self-Structuring Machine Learning Models. Bachelor's thesis, Harvard College.

Abstract

Machine learning models often use a large number of parameters in order to effectively approximate a function of interest on the entirety of its domain, leading to overfitting and high computational requirements. Towards the goal of reducing the number of parameters needed for good approximation, we introduce a new inference method in which a group of machine learning models self-assemble into a chain to collectively produce a single output for a given input. Each model in the chain takes input from the preceding model, produces an output, and adds another model to the chain to receive this output. The reuse of common units in each inference chain presents an opportunity for parameter count savings. In addition, we introduce an efficiency-aware metric that quantifies the predictive power of a group of models.

Description

Other Available Sources

Research Data

Keywords

Machine Learning, Neural Networks, Computer science, Artificial intelligence, Applied mathematics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories