DOI QR코드

DOI QR Code

Comparing the Performance of 17 Machine Learning Models in Predicting Human Population Growth of Countries

  • Otoom, Mohammad Mahmood (Department of Computer Science and Information, College of Science in Zulfi Majmaah University)
  • Received : 2021.01.05
  • Published : 2021.01.30

Abstract

Human population growth rate is an important parameter for real-world planning. Common approaches rely upon fixed parameters like human population, mortality rate, fertility rate, which is collected historically to determine the region's population growth rate. Literature does not provide a solution for areas with no historical knowledge. In such areas, machine learning can solve the problem, but a multitude of machine learning algorithm makes it difficult to determine the best approach. Further, the missing feature is a common real-world problem. Thus, it is essential to compare and select the machine learning techniques which provide the best and most robust in the presence of missing features. This study compares 17 machine learning techniques (base learners and ensemble learners) performance in predicting the human population growth rate of the country. Among the 17 machine learning techniques, random forest outperformed all the other techniques both in predictive performance and robustness towards missing features. Thus, the study successfully demonstrates and compares machine learning techniques to predict the human population growth rate in settings where historical data and feature information is not available. Further, the study provides the best machine learning algorithm for performing population growth rate prediction.

Keywords

Acknowledgement

The author would like to thank the Deanship of Scientific Research at Majmaah University for supporting this work under project number No. R-2021-20

References

  1. K. W. Kintigh and M. A. Peeples, "Estimating Population Growth Rates and Instantaneous Population from Periodized Settlement Data," Journal of Computer Applications in Archaeology, vol. 3, 2020. https://doi.org/10.5334/jcaa.60
  2. C. Rich, "The measurement ofthe rate of population growth," Journal of the Institute of actuaries», voi. LXV, parte I, 1934.
  3. A. N. Rose and N. N. Nagle, "Validation of spatiodemographic estimates produced through data fusion of small area census records and household microdata," Computers, Environment and Urban Systems, vol. 63, pp. 38-49, 2017. https://doi.org/10.1016/j.compenvurbsys.2016.07.006
  4. R. Wilson, E. zu Erbach-Schoenberg, M. Albert, D. Power, S. Tudge, M. Gonzalez, et al., "Rapid and near real-time assessments of population displacement using mobile phone data following disasters: the 2015 Nepal Earthquake," PLoS currents, vol. 8, 2016.
  5. P. Deville, C. Linard, S. Martin, M. Gilbert, F. R. Stevens, A. E. Gaughan, et al., "Dynamic population mapping using mobile phone data," Proceedings of the National Academy of Sciences, vol. 111, pp. 15888-15893, 2014. https://doi.org/10.1073/pnas.1408439111
  6. W. Anderson, S. Guikema, B. Zaitchik, and W. Pan, "Methods for estimating population density in data-limited areas: Evaluating regression and tree-based models in peru," PLoS one, vol. 9, p. e100037, 2014. https://doi.org/10.1371/journal.pone.0100037
  7. G. Bandyopadhyay and S. Chattopadhyay, "An Artificial Neural Net approach to forecast the population of India," arXiv preprint nlin/0607058, 2006.
  8. O. Folorunso, A. Akinwale, O. Asiribo, and T. Adeyemo, "Population prediction using artificial neural network," African Journal of Mathematics and Computer Science Research, vol. 3, pp. 155-162, 2010.
  9. N. Ahmed, N. A. Diptu, M. S. K. Shadhin, M. A. F. Jaki, M. F. Hasan, M. N. Islam, et al., "Artificial Neural Network and Machine Learning Based Methods for Population Estimation of Rohingya Refugees: Comparing Data-Driven and Satellite Image-Driven Approaches," Vietnam Journal of Computer Science, vol. 6, pp. 439-455, 2019. https://doi.org/10.1142/S2196888819500246
  10. V. Riiman, A. Wilson, R. Milewicz, and P. Pirkelbauer, "Comparing Artificial Neural Network and Cohort Component Models for Population Forecasts," Population Review, vol. 58, 2019.
  11. M. M. Otoom, M. Jemmali, Y. Qawqzeh, K. N. SA, and F. Al Fay, "Comparative Analysis of Different Machine Learning Models for Estimating the Population Growth Rate in Data-Limited Area," IJCSNS, vol. 19, p. 96, 2019.
  12. N. Cesare, C. Grant, Q. Nguyen, H. Lee, and E. O. Nsoesie, "How well can machine learning predict demographics of social media users?," arXiv preprint arXiv:1702.01807, 2017.
  13. United Nations, World Population Prospects 2017. 2017.
  14. G. Nargund, "Declining birth rate in Developed Countries: A radical policy re-think is required," Facts, views & vision in ObGyn, vol. 1, p. 191, 2009.
  15. J. Bongaarts, "Human population growth and the demographic transition," Philosophical Transactions of the Royal Society B: Biological Sciences, vol. 364, pp. 2985-2990, 2009. https://doi.org/10.1098/rstb.2009.0137
  16. R. Boucekkine, D. De la Croix, and D. Peeters, "Early literacy achievements, population density, and the transition to modern growth," Journal of the European Economic Association, vol. 5, pp. 183-226, 2007. https://doi.org/10.1162/JEEA.2007.5.1.183
  17. D. Ahlburg and R. Cassen, "Population and development," 2008.
  18. D. Steinley, "K-means clustering: a half-century synthesis," British Journal of Mathematical and Statistical Psychology, vol. 59, pp. 1-34, 2006. https://doi.org/10.1348/000711005X48266
  19. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, et al., "Scikit-learn: Machine learning in Python," the Journal of machine Learning research, vol. 12, pp. 2825-2830, 2011.