A Comparative Analysis of the Hybrid SVR-ACOR-Holt-Winters Model and the Whale-Optimized GEP Model for Dust Storm Prediction: A case study of Khuzestan Province

Document Type : Full length article

Authors

Department of Reclamation of arid and mountainous regions Engineering, Faculty of Natural Resources, University of Tehran, Karaj, Iran

Abstract

ABSTRACT
Khuzestan Province, as one of the main dust storm hotspots in Iran, is consistently confronted with serious challenges to public health and environmental sustainability due to its unique geographical characteristics. This study introduces a hybrid SVR-ACOR-Holt-Winters model to simulate dust storms in Khuzestan over a 50-year statistical period (1971–2020). The results reveal that the SVR-ACOR-Holt-Winters model outperforms the individual and dual hybrid models evaluated, achieving superior performance metrics, including a Root Mean Square Error (RMSE) between 0.293 and 0.317, a correlation coefficient (R) ranging from 0.849 to 0.873, and a Mean Absolute Error (MAE) between 0.275 and 0.293. Among all evaluated stations, Abadan demonstrated the highest prediction accuracy, which can be attributed to the influence of climatic variables such as extreme temperature and wind speed on increasing the frequency index of dust storm days in Khuzestan. These findings suggest that dust storm modeling yields higher accuracy in critical regions with greater dust event intensity. The outcomes of this research can be effectively utilized for dust storm modeling and the design of early warning systems, contributing to the mitigation of damages caused by such hazardous environmental phenomen.
Extended Abstract
Introduction
This study examined the performance of the triple-hybrid SVR-ACOR-HoltWinters metamodel for predicting the FDSD index in Khuzestan Province. The results were compared with those of the GEP-WOA hybrid model. All tested models showed the highest accuracy and efficiency during the first and second seasonal combinations across seven stations. Using data from one or two prior seasons improved the accuracy and performance of the predictions. Among the models, the proposed triple-hybrid metamodel was the most accurate and efficient, making it the best method for predicting dust storms in Khuzestan Province. On the other hand, the SVR-HoltWinters hybrid model had the lowest performance, with the smallest NS value and the highest MAE. This indicates that creating dual-hybrid models does not necessarily improve the accuracy of FDSD index predictions. The results of this study can help in better modeling of dust storms and support managerial decisions to reduce the damages caused by this phenomenon.
 
Methodology
The goal is to predict the frequency of dust storm days at seven meteorological stations in Khuzestan Province (Dezful, Safiabad, Masjed Soleyman, Bostan, Ahvaz, Bandar Mahshahr, and Abadan) over a 50-year period (1971–2020). As defined by the World Meteorological Organization, a dust storm day is when at least one of the eight synoptic observations includes a dust-related code (06, 07, 08, 09, 30, 31, 32, 33, 34, 35, or 98) in the present weather report. Additionally, the horizontal visibility associated with the reported code must be less than 1000 meters. In this study, hourly data on horizontal visibility and standardized WMO codes indicating visibility below 1000 meters were used to identify dust storms for all relevant codes.
 
Results and Discussion
The simulation results of dust storms in Khuzestan Province using the SVR algorithm show that the model performs more accurately in combinations one and two compared to other combinations. At the Bostan, Masjed Soleyman, and Dezful stations, the RMSE values decreased from 0.476, 0.481, and 0.491 days in combination four to 0.473, 0.477, and 0.488 days, respectively, after applying data from the previous season to predict dust storms in future seasons. Therefore, it can be concluded that simple models using the Support Vector Regression algorithm provide more accurate predictions of the FDSD index in Khuzestan Province. The results of the Holt-Winters model for predicting the FDSD index during the training and testing phases indicate better performance in all stations when using the first and second prediction scenarios compared to other scenarios. For instance, at the Masjed Soleyman station, the Nash-Sutcliffe efficiency coefficient decreased from 0.436 and 0.435 in combinations 2 and 1 to 0.438 and 0.437 in combinations 4 and 3. A comparison of the results from modeling the frequency of dust storm days using the standalone Support Vector Regression model and the dual-hybrid SVR-ACOR model shows improved efficiency and accuracy in the developed hybrid model. Therefore, when combined with the continuous Ant Colony Optimization catalyst, this model shows improved adaptability to variable and non-stationary data, resulting in higher accuracy in predicting the FDSD index. For instance, at the Bandar Mahshahr station, the RMSE and MAE values decreased from 0.439 and 0.422, respectively, in the Holt-Winters model to 0.406 and 0.398 after applying the dual-hybrid Holt-Winters-ACOR model. The triple-hybrid SVR-ACOR-HoltWinters model is capable of modeling both nonlinear and seasonal patterns of dust storm trends. In this regard, the developed hybrid model showed its best performance in the first and second prediction stages. Thus, it can be concluded that using past seasons to model future trends does not provide optimal results. A comparison of the dust storm prediction results using the SVR-HoltWinters model at the Safiabad station in the first seasonal combination shows a significant reduction in the RMSE value, from 0.501 to 0.307 days, in the optimal combination 1 of the triple-hybrid model.
Conclusion
Dust storms are one of the prominent environmental and climatic challenges in arid and semi-arid regions of Iran and the world, particularly in Khuzestan Province. This phenomenon has widespread impacts on human health, agriculture, infrastructure, quality of life, and the balance of natural ecosystems. Addressing this challenge requires the adoption of comprehensive and multidimensional approaches. The aim of this study was to evaluate the performance of the triple-hybrid model SVR-ACOR-HoltWinters and compare it with the individual models SVR, HoltWinters, and dual-hybrid models SVR-ACOR, HoltWinters-ACOR, and SVR-HoltWinters, as well as to analyze and compare the results with the gene expression programming model optimized by the whale optimization algorithm for predicting the frequency of dust storm days in seven meteorological stations across Khuzestan province.The results showed that the triple-hybrid model SVR-ACOR-HoltWinters demonstrated the best performance among all the models tested in predicting the frequency of dust storm days at all stations. The findings indicated an improvement in the performance of all models when using one or two prior seasons for dust storm prediction. It was thus clear that using older seasons did not improve the models' efficiency or accuracy. In this regard, simple models played a significant role in improving the evaluation metrics. Abadan station had the highest number of dust storm days in the seasonal scale, while Dezful station had the lowest frequency. It was evident that as the FDSD index increased at the stations, the performance of the triple-hybrid and individual and dual-hybrid models also improved. The results of the dual-hybrid models revealed an increase in the performance of the SVR and HoltWinters models after combining them with the continuous Ant Colony Optimization catalyst. Based on the results from the goodness-of-fit metrics, the dual-hybrid HoltWinters-ACOR model ranked second as the best dust storm prediction model in Khuzestan Province. It is also important to note that the gene expression programming model, after being integrated with the whale optimization algorithm, provided relatively better results compared to other dual-hybrid models, securing the third position. Furthermore, the results from the dual-hybrid SVR-HoltWinters model showed a reduction in performance and accuracy when predicting the frequency of dust storm days in Khuzestan compared to the other individual and hybrid models studied in this research.
 
Funding
There is no funding support.
 
Authors’ Contribution
All of the authors approved the content of the manuscript and agreed on all aspects of the work.
 
Conflict of Interest
Authors declared no conflict of interest.
 
Acknowledgments
We are grateful to all the scientific consultants of this paper.

Keywords

Main Subjects


  1. Abdolshahnejad, M., Khosravi, H., Nazari Samani, A., Zehtabian, G. & Alambaigi, A. (2020). Determining the Conceptual Framework of Dust Risk Based on Evaluating Resilience (Case Study: Southwest of Iran). Strategic Research Journal of Agricultural Sciences and Natural Resources5(1), 33-44. https://doi.org/10.22047/srjasnr.2020.113050 [In Persian]
  2. Alfarisi, W. W., Haura, Z., Pratiwi, D. A., Putri, F. A., Sadewo, E., & Darmawan, G. (2024). Comparative Analysis of Fourier Series Analysis and Holtwinters Methods on Forecasting Additive Seasonal Data. Innovative: Journal of Social Science Research4(1), 1310-1325.
  3. Andriani, N., Wahyuningsih, S., & Siringoringo, M. (2022). Application of Double Exponential Smoothing Holt and Triple Exponential Smoothing Holt-Winter with Golden Section Optimization to Forecast Export Value of East Borneo Province. Jurnal Matematika, Statistika Dan Komputasi18(3), 475-483. https://doi.org/10.20956/j.v18i3.17492
  4. Ansari Ghojghar, M., Pourgholam-Amiji, M., Araghinejad, S., Babaeian, I., Liaghat, A. and Salajegheh, A. (2021). The Effect of Warm Phase of ENSO Phenomenon on the Formation of Dust Storms in Khuzestan and Sistan and Baluchestan Provinces. Journal of Range and Watershed Managment, 74(2), 257-271. https://doi.org/10.22059/jrwm.2020.310647.1533 [In Persian]
  5. Ansari ghojghar, M., Pourgholam-Amiji, M., Bazrafshan, J., Liaghat, A. & Araghinejad, S. (2020). Performance Comparison of Statistical, Fuzzy and Perceptron Neural Network Models in Forecasting Dust Storms in Critical Regions in Iran. Iranian Journal of Soil and Water Research51(8), 2051-2063. https://doi.org/10.22059/ijswr.2020.302529.668607 [In Persian]
  6. Bhoopathi, S., Kumar, N., & Pal, M. (2024). Evaluating the performances of SVR and XGBoost for short-range forecasting of heatwaves across different temperature zones of India. Applied Computing and Geosciences24, 100204. https://doi.org/10.1016/j.acags.2024.100204
  7. Borhani, F., Pourezzat, A. A., & Ehsani, A. H. (2024). Spatial distribution of Particulate Matter in Iran from Internal factors to the role of western adjacent countries from political governance to Environmental Governance. Earth Systems and Environment8(1), 135-164. https://doi.org/10.1007/s41748-023-00365-x
  8. Dehshiri, S. S. H., & Firoozabadi, B. (2023). A multi-objective framework to select numerical options in air quality prediction models: A case study on dust storm modeling. Science of The Total Environment, 863, 160681. https://doi.org/10.1016/j.scitotenv.2022.160681
  9. Dun, M., Xu, Z., Chen, Y., & Wu, L. (2020). Short-term air quality prediction based on fractional grey linear regression and support vector machine. Mathematical Problems in Engineering, 2020, Article 8914501. https://doi.org/10.1155/2020/8914501
  10. Ejdari Moghaddam, M., Raeispour, K., Aziziaghli, M. A., & Ghaderizah, K. (2011). Identification of synoptic patterns of dust storm systems in southwest Iran: A case study of Khuzestan Province. The 11th Congress of Iranian Geographers, Tehran, Iran. [In Persian]
  11. Ejderi Moghadam, M., & Raeispour, K. (2011). Statistical analysis and identification of the origin of dust phenomena using meteorological codes (Case study: Khuzestan Province). The 11th Congress of Iranian Geographers, Tehran, Iran [In Persian]
  12. Ferreira, C. (2006). Gene expression programming: Mathematical modeling by an artificial intelligence. Springer.
  13. Feuerstein, S., & Schepanski, K. (2018). Identification of dust sources in a Saharan dust hot-spot and their implementation in a dust-emission model. Remote Sensing, 11(1), Article 4. https://doi.org/10.3390/rs11010004
  14. Gandomi, A. H., Roke, D. A., & Sett, K. (2013). Genetic programming for moment capacity modeling of ferrocement members. Engineering Structures57, 169-176. https://doi.org/10.1016/j.engstruct.2013.09.022
  15. Ghafariasl, P., Zeinalnezhad, M., & Ahmadishokooh, A. (2024). Optimizing PM2. 5 Forecasting Accuracy with Hybrid Meta-Heuristic and Machine Learning Models. arXiv preprint arXiv:2407.01647. https://doi.org/10.48550/arXiv.2407.01647
  16. Goudie, A. S. (2014). Desert dust and human health disorders. Environment International, 63, 101-113. https://doi.org/10.1016/j.envint.2013.10.011
  17. Hossain, J., Kadir, A. F., Hanafi, A. N., Shareef, H., Khatib, T., Baharin, K. A., & Sulaima, M. F. (2023). A review on optimal energy management in commercial buildings. Energies, 16(4), Article 1609. https://doi.org/10.3390/en16041609
  18. Jahanmiri, S., Noorian-Bidgoli, M., & Dallaki, A. (2025). Optimizing tunnel boring machine penetration rate prediction using gene expression programming and whale optimization algorithms. Modeling Earth Systems and Environment11(2), 1-22. https://doi.org/10.1007/s40808-025-02323-0
  19. Lashkari, H. and Sabuei, M. (2013). Synoptic analysis of dominant patterns in Khuzestan province’s dust storm. Scientific- Research Quarterly of Geographical Data (SEPEHR), 22(87), 32-38. https://dor.isc.ac/dor/20.1001.1.25883860.1392.22.87.5.6 [In Persian]
  20. Mehrabi, S., Soltani, S., & Jafari, R. (2015). Analyzing the relationship between dust storm occurrence and climatic parameters. Journal of Water and Soil Science, 19(71), 69–81. http://dx.doi.org/10.18869/acadpub.jstnar.19.71.69 [In Persian]
  21. Middleton, N. (2019). Variability and trends in dust storm frequency on decadal timescales: Climatic drivers and human impacts. Geosciences9(6), 261. https://doi.org/10.3390/geosciences9060261
  22. Mohammadi, G. H. (2015). Analysis of atmospheric mechanisms in dust transport from western Iran. (Doctoral dissertation, University of Tabriz). 142 pages. [In Persian]
  23. Nemati, A. M. and Kuckaki Rafsanjani, M. (2024). Review of Fuzzy and Non-Fuzzy methods based on the Whale Optimization Algorithm and their applications. Fuzzy Systems and its Applications7(2), 33-63. https://doi.org/10.22034/JFSA.2024.467966.1234 [In Persian]
  24. O’Loingsigh, T., McTainsh, G. H., Tews, E. K., Strong, C. L., Leys, J. F., Shinkfield, P., & Tapper, N. J. (2014). The Dust Storm Index (DSI): a method for monitoring broadscale wind erosion using meteorological records. Aeolian Research, 12, 29-40. https://doi.org/10.1016/j.aeolia.2013.10.004
  25. Özer, M. M. (2024). Predictive modeling of urban air pollution using machine learning and unmanned aerial vehicle platforms. In Innovative Applications of Artificial Neural Networks to Data Analytics and Signal Processing Springer, Cham (pp. 79-115). https://doi.org/10.1007/978-3-031-69769-2_5
  26. Pal, M., Maity, R., Ratnam, J. V., Nonaka, M., & Behera, S. K. (2020). Long-lead prediction of ENSO modoki index using machine learning algorithms. Scientific reports10(1), 365. https://doi.org/10.1038/s41598-019-57183-3
  27. Panicker, N. K. K., & Valarmathi, J. (2025). Time series prediction of aerosol optical depth across the northern Indian region: integrating PSO-optimized SARIMA-SVR based on MODIS data. Acta Geophysica73(2), 2097-2126. https://doi.org/10.1007/s11600-024-01472-7
  28. Piri, J., Kahkha, M. R. R., & Kisi, O. (2024). Hybrid machine learning approach integrating GMDH and SVR for heavy metal concentration prediction in dust samples. Environmental Science and Pollution Research, 1-20. https://doi.org/10.1007/s11356-024-34795-5
  29. Pourgholam Amiji, M., Ansari Ghojghar, M., Bazrafshan, J., Liaghat, A., & Araghinejad, S. (2020). Comparing the performance of SARIMA and Holt-Winters time series models with artificial intelligence methods in dust storms forecasting (Case Study: Sistan and Baluchestan Province). Physical Geography Research, 52(4), 567-587. https://doi.org/10.22059/jphgr.2021.303847.1007524 [In Persian]
  30. Rafi, N., & Rivas, P. (2024). A review on machine learning algorithms for dust aerosol detection using satellite data. Springer. https://doi.org/10.48550/arXiv.2404.09415
  31. Rahmati, O., Panahi, M., Ghiasi, S. S., Deo, R. C., Tiefenbacher, J. P., Pradhan, B., ... & Bui, D. T. (2020). Hybridized neural fuzzy ensembles for dust source modeling and prediction. Atmospheric Environment224, 117320. https://doi.org/10.1016/j.atmosenv.2020.117320
  32. Sobhani, B., Safarian Zengir, V. & Faizollahzadeh, S. (2020). Modeling and Prediction of Dust in Western Iran. Physical Geography Research52(1), 17-35. https://doi.org/10.22059/jphgr.2020.284389.1007408 [In Persian]
  33. Socha, K., & Dorigo, M. (2008). Ant colony optimization for continuous domains. European journal of operational research185(3), 1155-1173. https://doi.org/10.1016/j.ejor.2006.06.046
  34. Song, Z., Wang, C., Hou, Y., Wang, B., & Chen, W. (2024). Time series analysis of PM2.5 pollution risk based on the supply and demand of PM2.5 removal service: A case study of the urban areas of Beijing. Environmental Monitoring and Assessment, 196(7), Article 637. https://doi.org/10.1007/s10661-024-12831-8
  35. Tang, C., Sun, W., Xue, M., Zhang, X., Tang, H., & Wu, W. (2022). A hybrid whale optimization algorithm with artificial bee colony. Soft Computing, 26(5), 2075-2097. https://doi.org/10.1007/s00500-021-06623-2
  36. Turgut, M. S., Sağban, H. M., Turgut, O. E., & Özmen, Ö. T. (2021). Whale optimization and sine-cosine optimization algorithms with cellular topology for parameter identification of chaotic systems and Schottky barrier diode models. Soft Computing, 25(2), 1365-1409. https://doi.org/10.1007/s00500-020-05227-6
  37. Yarmohamadi, M., Alesheikh, A. A. and Sharif, M. (2023). A Convolutional Neural Network Model for Predicting the Transport Pathway of Dust Storms. Iranian Journal of Remote Sensing and GIS15(1), 129-139. https://doi.org/10.52547/gisj.15.1.129 [In Persian]
  38. Yong, S. L. S., Ng, J. L., Huang, Y. F., Ang, C. K., Ahmad Kamal, N., Mirzaei, M., & Najah Ahmed, A. (2024). Enhanced daily reference evapotranspiration estimation using optimized hybrid support vector regression models. Water Resources Management38(11), 4213-4241. https://doi.org/10.1007/s11269-024-03860-6
  39. Zaheer, K., Saeed, S., & Tariq, S. (2023). Prediction of aerosol optical depth over Pakistan using novel hybrid machine learning model. Acta Geophysica, 71(4), 2009-2029. https://doi.org/10.1007/s11600-023-01072-x
  40. Zeinali, M. & Pourreza-Bilondi, M. (2018). Estimation of Optimal Parameters of the Nonlinear Muskingum Model Using Continuous Ant Colony Algorithm. Irrigation and Water Engineering8(3), 94-106. [In Persian]