Publication:
Random Forests for Global and Regional Crop Yield Predictions

Thumbnail Image

Open/View Files

Date

2016

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Jeong, J. H., J. P. Resop, N. D. Mueller, D. H. Fleisher, K. Yun, E. E. Butler, D. J. Timlin, et al. 2016. “Random Forests for Global and Regional Crop Yield Predictions.” PLoS ONE 11 (6): e0156571. doi:10.1371/journal.pone.0156571. http://dx.doi.org/10.1371/journal.pone.0156571.

Research Data

Abstract

Accurate predictions of crop yield are critical for developing effective agricultural and food policies at the regional and global scales. We evaluated a machine-learning method, Random Forests (RF), for its ability to predict crop yield responses to climate and biophysical variables at global and regional scales in wheat, maize, and potato in comparison with multiple linear regressions (MLR) serving as a benchmark. We used crop yield data from various sources and regions for model training and testing: 1) gridded global wheat grain yield, 2) maize grain yield from US counties over thirty years, and 3) potato tuber and maize silage yield from the northeastern seaboard region. RF was found highly capable of predicting crop yields and outperformed MLR benchmarks in all performance statistics that were compared. For example, the root mean square errors (RMSE) ranged between 6 and 14% of the average observed yield with RF models in all test cases whereas these values ranged from 14% to 49% for MLR models. Our results show that RF is an effective and versatile machine-learning method for crop yield predictions at regional and global scales for its high accuracy and precision, ease of use, and utility in data analysis. RF may result in a loss of accuracy when predicting the extreme ends or responses beyond the boundaries of the training data.

Description

Keywords

Biology and Life Sciences, Agriculture, Crop Science, Crops, Cereal Crops, Maize, Organisms, Plants, Grasses, Model Organisms, Plant and Algal Models, Wheat, Solanum, Potato, Vegetables, Plant Science, Plant Anatomy, Tubers, Agrochemicals, Fertilizers, Mathematical and Statistical Techniques, Statistical Methods, Regression Analysis, Linear Regression Analysis, Physical Sciences, Mathematics, Statistics (Mathematics), Agricultural Soil Science, Ecology and Environmental Sciences, Soil Science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories