Publication:
A Framework for Rapid Active Learning in Resource-Constrained Environmental Sensing Domains

No Thumbnail Available

Date

2022-05-23

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Behari, Nikhil Swaminathan. 2022. A Framework for Rapid Active Learning in Resource-Constrained Environmental Sensing Domains. Bachelor's thesis, Harvard College.

Research Data

Abstract

Recent advancements in deep convolutional neural networks have enabled accurate, efficient, and intelligent feature learning for a wide variety of classification tasks. However, there remains a research to practice gap that has limited the deployment of these vision systems in socially relevant domains. In particular, the prohibitive cost of manual data annotation for training these classification models, and the unique qualities of naturally observed datasets, such as imbalanced, multispectral input features, can limit the ability to develop robust, stable, and generalizable classification models in resource-constrained settings. In this work, we proposed a novel active learning methodology to enable efficient data labeling in these challenging classification settings, accounting for common dataset constraints of socially focused domains such as small sample sizes, rare positive class samples, and multispectral/thermal input features. We additionally test the efficacy and generalizability of our proposed framework in two distinct domains of ecology and disaster management. We find that the proposed approach significantly reduces the number of manual annotations required to develop stable and accurate classification models, as well as rapidly identifies important, rare positive samples through the developed automatic querying procedure. In turn, we find that this reduction in the manual annotation requirements for classification training translates to significant time-saving measures for human annotators, which can enable more ubiquitous, equitable, and accessible deployment of deep learning-based classification models in important social domains.

Description

Other Available Sources

Keywords

active learning, AI for social good, computer vision, environmental sensing, machine learning, resource-constrained, Artificial intelligence, Computer science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories