Paper
Document
Download
Flag content
0

Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China

0
TipTip
Save
Document
Download
Flag content

Abstract

The knowledge of global solar radiation (H) is a prerequisite for the use of renewable solar energy, but H measurements are always not available due to high costs and technical complexities. The present study proposes two machine learning algorithms, i.e. Support Vector Machine (SVM) and a novel simple tree-based ensemble method named Extreme Gradient Boosting (XGBoost), for accurate prediction of daily H using limited meteorological data. Daily H, maximum and minimum air temperatures (Tmax and Tmin), transformed precipitation (Pt, 1 for rainfall > 0 and 0 for rainfall = 0) and extra-terrestrial solar radiation (H0) during 1966–2000 and 2001–2015 from three radiation stations in humid subtropical China were used to train and test the models, respectively. Two combinations of input parameters, i.e. (i) only Tmax, Tmin and Ra, and (ii) complete data were considered for simulations. The proposed machine learning models were also compared with four well-known empirical models to evaluate their performances. The results suggest that the SVM and XGBoost models outperformed the selected empirical models. The performance of the machine learning models was improved by 5.9–12.2% for training phase and by 8.0–11.5% for testing phase in terms of RMSE when information of precipitation was further included. Compared with the SVM model, the XGBoost model generally showed better performance for training phase, and slightly weaker but comparable performance for testing phase in terms of accuracy. However, the XGBoost model was more stable with average increase of 6.3% in RMSE, compared to 10.5% for the SVM algorithm. Also, the XGBoost model (3.02 s and 0.05 s for training and testing phase, respectively) showed much higher computation speed than the SVM model (27.48 s and 4.13 s for training and testing phase, respectively). By jointly considering the prediction accuracy, model stability and computational efficiency, the XGBoost model is highly recommended to estimate daily H using commonly available temperature and precipitation data with excellent performance in humid subtropical climates.

Paper PDF

This paper's license is marked as closed access or non-commercial and cannot be viewed on ResearchHub. Visit the paper's external site.