DOI QR코드

DOI QR Code

A research on the key factors for classification of diabetes based on random forest

  • Shin, Yong sub (Graduate School of Smart Convergence Kwangwoon University) ;
  • Lee, Namju (Department of Physical Education, Institute of Information Technology, Kwangwoon University) ;
  • Hwang, Chigon (Department of Computer Engineering, Institute of Information Technology, Kwangwoon University)
  • Received : 2020.06.06
  • Accepted : 2020.06.16
  • Published : 2020.08.31

Abstract

Recently, the number of people visiting the hospital is increasing due to diabetes. According to the Korean Diabetes Association, statistically, 1 in 7 adults over the age of 30 are suffering from diabetes. As such, diabetes is one of the most common diseases among modern people. In this paper, in addition to blood sugar, which is widely used for diabetes awareness, BMI, which is known to be related to diabetes, triglycerides and cholesterol that cause various complications in diabetics it was studied using random forest techniques and decision trees known to be effective for classification. The importance of each element was confirmed using the results and characteristic importance derived using two techniques. Through this, we studied the diabetes-related relationship between BMI, triglyceride, and cholesterol as well as blood sugar, a factor that diabetic patients should pay much attention to.

Keywords

References

  1. The Institute of Internet, Broadcasting and Communication, Submission of manuscript. http://www.iibc.kr.
  2. Krishnasamy, S., & Abell, T. L. (2018). Diabetic gastro paresis: principles and current trends in management. Diabetes Therapy, 9(1), 1-42. DOI: https://doi.org/10.1007/s13300-018-0454-9
  3. 2018_ Cause of death statistics (2019) Statistical Office
  4. Sung-ha Lee, & Hoon Jin. (2013). Analysis and Prediction of Diabetic Patients using Decision Tree. Korean Society of Electronics Engineers Conference Academic conference, 829-833.
  5. Minjin Lee, & Sang soo Kim. (2017). Obesity management in diabetics. Journal of Korean Diabetes, 18(4).
  6. Korean Diabetes Association https://www.diabetes.or.kr/general/class/index.php?idx=2
  7. Jae kyu Lee, Soonbeom Kwon, Gyu-geon Lim, Management Information Systems, bubyoungsa. pp.534, 2005.
  8. Muller, A. C., & Guido, S, Introduction to machine learning with Python: a guide for data scientists, O'Reilly Media, Inc, 2016
  9. Deng, H., Runger, G., & Tuv, E. (2011, June). Bias of importance measures for multi-valued attributes and solutions. In International conference on artificial neural networks (pp. 293-300). Springer, Berlin, Heidelberg. DOI: https://doi.org/10.1007/978-3-642-21738-8_38
  10. Breiman, L. (1996). Bagging predictors. Machine learning, 24(2), 123-140. DOI: https://doi.org/10.1007/bf00058655
  11. Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324
  12. Rokach, L., & Maimon, O. (2005). Top-down induction of decision trees classifiers-a survey. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 35(4), 476-487, DOI: https://doi.org/10.1109/tsmcc.2004.843247
  13. Polikar, R. (2006). Ensemble based systems in decision making. IEEE Circuits and systems magazine, 6(3), 21-45, DOI: https://doi.org/10.1109/mcas.2006.1688199
  14. Jin, C., De-Lin, L., & Fen-Xiang, M. (2009, July). An improved ID3 decision tree algorithm. In 2009 4th International Conference on Computer Science & Education (pp. 127-130). IEEE.
  15. Kyunghee University Hospital, https://www.khuh.or.kr/04/01.php?hospitalpath=md&table=mdlecture&page=5&command= view_article&key=348&s_key=&keycode=&keycode2=
  16. Sunjoo Boo. (2012). Glucose, Blood Pressure, and Lipid Control in Korean Adults with Diagnosed Diabetes. Korean J Adult Nurs, 24(4), 406-416. DOI: https://doi.org/10.4028/www.scientific.net/amr.962-965.2842