DOI QR코드

DOI QR Code

Optimization of Random Subspace Ensemble for Bankruptcy Prediction

재무부실화 예측을 위한 랜덤 서브스페이스 앙상블 모형의 최적화

  • Received : 2015.04.27
  • Accepted : 2015.12.09
  • Published : 2015.12.31

Abstract

Ensemble classification is to utilize multiple classifiers instead of using a single classifier. Recently ensemble classifiers have attracted much attention in data mining community. Ensemble learning techniques has been proved to be very useful for improving the prediction accuracy. Bagging, boosting and random subspace are the most popular ensemble methods. In random subspace, each base classifier is trained on a randomly chosen feature subspace of the original feature space. The outputs of different base classifiers are aggregated together usually by a simple majority vote. In this study, we applied the random subspace method to the bankruptcy problem. Moreover, we proposed a method for optimizing the random subspace ensemble. The genetic algorithm was used to optimize classifier subset of random subspace ensemble for bankruptcy prediction. This paper applied the proposed genetic algorithm based random subspace ensemble model to the bankruptcy prediction problem using a real data set and compared it with other models. Experimental results showed the proposed model outperformed the other models.

Keywords

References

  1. Abellan, J. and C.J. Mantas, "Improving Experimental Studies about Ensembles of Classifiers for Bankruptcy Prediction and Credit Scoring", Expert Systems with Applications, Vol.41, No.8, 2014, 3825-3830. https://doi.org/10.1016/j.eswa.2013.12.003
  2. Altman, E.L., "Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy", The Journal of Finance, Vol. 23, No.4, 1968, 589-609. https://doi.org/10.1111/j.1540-6261.1968.tb00843.x
  3. Beaver, W., "Financial Ratios as Predictors of Failure, Empirical Research in Accounting : Selected Studied", Journal of Accounting Research, Vol.4, No.3, 1966, 71-111. https://doi.org/10.2307/2490171
  4. Bian, S. and W. Wang, "On Diversity and Accuracy of Homogeneous and Heterogeneous Ensembles", International Journal of Hybrid Intelligent Systems, Vol.4, No.2, 2007, 103-128. https://doi.org/10.3233/HIS-2007-4204
  5. Breiman, L., "Bagging Predictors", Machine Learning, Vol.24, No.2, 1996, 123-140. https://doi.org/10.1023/A:1018054314350
  6. Buta, P., "Mining for Financial Knowledge with CBR", AI Expert, Vol.9, No.10, 1994, 34-41.
  7. Dietterich, T.G., "Machine-Learning Research : Four Current Directions", AI Magazine, Vol.18, No.4, 1997, 97-136.
  8. Freund, Y. and R. Schapire, "Experiments with a New Boosting Algorithm", Proceedings of the 13th International Conference on Machine learning, 1996, 148-156.
  9. Ho, T., "The Random Subspace Method for Construction Decision Forests", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.20, No.8, 1998, 832-844. https://doi.org/10.1109/34.709601
  10. Ho, T., "Multiple Classifier Combination : Lessons and Next Steps", Hybrid Methods in Pattern Recognition, Kandel and Bunke, Eds. Singapore : World Scientific, 2002, 171-198.
  11. Kim, J.I. and E.J. Lee, "A Technique to Apply Inlining for Code Obfuscation Based on Genetic Algorithm", Journal of Information Technology Services, Vol.10, No.3, 2011, 167-177.(김정일, 이은주, "유전 알고리즘에 기반한 코드 난독화를 위한 인라인 적용 기법", 한국IT서비스학회지, 제10권, 제3호, 2011, 167-177.)
  12. Kim, M.J., "A Performance Comparison of Ensemble in Bankruptcy Prediction", Entrue Journal of Information Technology, Vol.8, No.2, 2009, 41-49.(김명종, "기업부실화 예측에 대한 앙상블 학습의 성과 비교", 엔트루 저널, 제8권, 제2호, 2009, 41-49.)
  13. Kim, M.J., "Optimal Selection of Classifier Ensemble Using Genetic Algorithms", Journal of Intelligence and Information Systems, Vol.16, No.4, 2010, 99-112.(김명종, "유전자 알고리즘을 이용한 분류자 앙상블의 최적 선택", 지능정보연구, 제16권, 제4호, 2010, 99-112.)
  14. Kim, M.J., D.K. Kang, and H.B. Kim, "Geometric Mean Based Boosting Algorithm with Over-Sampling to Resolve Data Imbalance Problem for Bankruptcy Prediction", Expert Systems with Applications, Vol.42, No.3, 2015, 1074-1082. https://doi.org/10.1016/j.eswa.2014.08.025
  15. Kim, S.H. and J.W. Kim, "SOHO Bankruptcy Prediction Using Modified Bagging Predictors", Journal of Intelligence and Information Systems, Vol.13, No.2, 2007, 15-26.(김승혁, 김종우, "Modified Bagging Predictors를 이용한 SOHO 부도 예측", 한국지능정보시스템학회논문지, 제13권, 제2호, 2007, 15-26.)
  16. Kuncheva, L.I. and C.J. Whitaker, "Measures of Diversity in Classifier Ensembles and their Relationship with the Ensemble Accuracy", Machine Learning, Vol.51, No.2, 2003, 181-207. https://doi.org/10.1023/A:1022859003006
  17. Kuncheva, L.I., "Combining Pattern Classifiers : Methods and Algorithms", Wiley-Interscience, 2004.
  18. Lee, E.J., Y.R. Seo, S.Y. Yoon, H.R. Jang, and H.K. Bahn, "Power Scheduling of Smart Buildings in the Smart Grid Environment Using IT Optimization Techniques", Journal of Information Technology Services, Vol.11, 2012, 41-50.(이은지, 서유리, 윤소영, 장혜린, 반효경, "IT 최적화 기술을 이용한 지능형전력망 환경의 스마트 빌딩 스케줄링", 한국IT서비스학회지, 제11권, 특집호, 2012, 41-50.)
  19. Li, H., Y.C. Lee, Y.C. Zhou, and J. Sun, "The Random Subspace Binary Logit(RSBL) Model for Bankruptcy Prediction", Knowled ge-Based Systems, Vol.24, No.8, 2011, 1380-1388. https://doi.org/10.1016/j.knosys.2011.06.015
  20. Louzada, F., O. Anacleto-Junior, C. Candolo, and J. Mazucheli, "Poly-Bagging Predictors for Classification Modelling for Credit Scoring", Expert Systems with Applications, Vol.38, No.10, 2011, 12717-12720. https://doi.org/10.1016/j.eswa.2011.04.059
  21. Meyer, P.A. and H. Pifer, "Prediction of Bank Failures", The Journal of Finance, Vol.25, No.4, 1970, 853-868. https://doi.org/10.1111/j.1540-6261.1970.tb00558.x
  22. Min, S.H., "Bankruptcy Prediction Using an Improved Bagging Ensemble", Journal of Intelligence and Information Systems, Vol.20, No.4, 2014, 121-139.(민성환, "개선된 배깅 앙상블을 활용한 기업부도 예측", 지능정보연구, 제20권, 제4호, 2014, 121-139.) https://doi.org/10.13088/JIIS.2014.20.4.121
  23. Ohlson, J., "Financial Ratios and the Probabilistic Prediction of Bankruptcy", Journal of Accounting Research, Vol.18, No.1, 1980, 109-131. https://doi.org/10.2307/2490395
  24. Shin, T. and T. Hong, "Corporate Credit Rating Based on Bankruptcy Probability Using AdaBoost Algorithm-Based Support Vector Machine", Journal of Intelligence and Information Systems, Vol.17, No.3, 2011, 25-41.
  25. Yule, G.U., "On the association of attributes in statistics : with illustrations from the material of the childhood society, and Philosophical Transactions of the Royal Society of London", Series A, Containing Papers of a Mathematical or Physical Character, 1900, 257-319.
  26. Zhang, G., Y.M. Hu, E.B. Patuwo, and C.D. Indro, "Artificial Neural Networks in Bankruptcy Prediction : General Framework and Cross-Validation Analysis", European Journal of Operational Research, Vol.116, No.1, 1999, 16-32. https://doi.org/10.1016/S0377-2217(98)00051-4