Publication: A Framework for Rapid Active Learning in Resource-Constrained Environmental Sensing Domains
No Thumbnail Available
Open/View Files
Date
2022-05-23
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Behari, Nikhil Swaminathan. 2022. A Framework for Rapid Active Learning in Resource-Constrained Environmental Sensing Domains. Bachelor's thesis, Harvard College.
Research Data
Abstract
Recent advancements in deep convolutional neural networks have enabled accurate, efficient, and intelligent feature learning for a wide variety of classification tasks. However, there remains a research to practice gap that has limited the deployment of these vision systems in socially relevant domains. In particular, the prohibitive cost of manual data annotation for training these classification models, and the unique qualities of naturally observed datasets, such as imbalanced, multispectral input features, can limit the ability to develop robust, stable, and generalizable classification models in resource-constrained settings. In this work, we proposed a novel active learning methodology to enable efficient data labeling in these challenging classification settings, accounting for common dataset constraints of socially focused domains such as small sample sizes, rare positive class samples, and multispectral/thermal input features. We additionally test the efficacy and generalizability of our proposed framework in two distinct domains of ecology and disaster management. We find that the proposed approach significantly reduces the number of manual annotations required to develop stable and accurate classification models, as well as rapidly identifies important, rare positive samples through the developed automatic querying procedure. In turn, we find that this reduction in the manual annotation requirements for classification training translates to significant time-saving measures for human annotators, which can enable more ubiquitous, equitable, and accessible deployment of deep learning-based classification models in important social domains.
Description
Other Available Sources
Keywords
active learning, AI for social good, computer vision, environmental sensing, machine learning, resource-constrained, Artificial intelligence, Computer science
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service