Publication:

Efficient and Scalable Tiny Machine Learning

Loading...
Thumbnail Image

Date

2024-07-16

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Banbury, Colby Richard. 2024. Efficient and Scalable Tiny Machine Learning. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Abstract

Machine learning (ML) has revolutionized computing, but for ML to become fully ubiquitous, it must be deployed on the billions of ultra-low-power sensing devices that already permeate society. This requires extreme levels of efficiency that are typically achieved through expensive specialization and optimization; however, this is impractical given the diversity of use cases we wish to address. We need scalable solutions that achieve efficiency without incurring substantial engineering costs.

In this thesis, we discuss techniques to achieve extreme efficiency across the ML stack by creating benchmarks, automating model design, and bootstrapping datasets. First, this thesis introduces MLPerf Tiny, a benchmark for ultra-low-power ML (TinyML) hardware, which enables researchers to isolate the impact of individual optimizations and make progress more measurable and tractable. Second, we characterize ML performance on commodity microcontrollers (MCUs) and automate the process of hardware-specific model design with a fast neural architecture search tool. This enables state-of-the-art energy efficiency without substantial manual effort or a computationally expensive search algorithm. Finally, we created Wake Vision, a large, high-quality dataset for TinyML person detection. We demonstrate that we can bootstrap TinyML-relevant datasets through automated data filtering techniques. Additionally, Wake Vision includes a fine-grain benchmark suite to measure the robustness and fairness of a model in challenging settings. The contributions described in this dissertation establish a foundation for future TinyML research and chart a path toward smart, ubiquitous computing.

Description

Other Available Sources

Research Data

Keywords

Efficient AI, Machine Learning, TinyML, Computer science, Computer engineering

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories