Assessing the performance of a suite of machine learning models for daily river water temperature prediction

PeerJ. 2019 Jun 4:7:e7065. doi: 10.7717/peerj.7065. eCollection 2019.

Abstract

In this study, different versions of feedforward neural network (FFNN), Gaussian process regression (GPR), and decision tree (DT) models were developed to estimate daily river water temperature using air temperature (Ta ), flow discharge (Q), and the day of year (DOY) as predictors. The proposed models were assessed using observed data from eight river stations, and modelling results were compared with the air2stream model. Model performances were evaluated using four indicators in this study: the coefficient of correlation (R), the Willmott index of agreement (d), the root mean squared error (RMSE), and the mean absolute error (MAE). Results indicated that the three machine learning models had similar performance when only Ta was used as the predictor. When the day of year was included as model input, the performances of the three machine learning models dramatically improved. Including flow discharge instead of day of year, as an additional predictor, provided a lower gain in model accuracy, thereby showing the relatively minor role of flow discharge in river water temperature prediction. However, an increase in the relative importance of flow discharge was noticed for stations with high altitude catchments (Rhône, Dischmabach and Cedar) which are influenced by cold water releases from hydropower or snow melting, suggesting the dependence of the role of flow discharge on the hydrological characteristics of such rivers. The air2stream model outperformed the three machine learning models for most of the studied rivers except for the cases where including flow discharge as a predictor provided the highest benefits. The DT model outperformed the FFNN and GPR models in the calibration phase, however in the validation phase, its performance slightly decreased. In general, the FFNN model performed slightly better than GPR model. In summary, the overall modelling results showed that the three machine learning models performed well for river water temperature modelling.

Keywords: Air temperature; Artificial neural network; Decision tree; Flow discharge; Gaussian process regression; River water temperature.

Grants and funding

This work was jointly funded by the National Key R&D Program of China (2018YFC0407200), the China Postdoctoral Science Foundation (2018M640499), and the research project from Nanjing Hydraulic Research Institute (Y118009). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.