DOI QR코드

DOI QR Code

Corporate Credit Rating based on Bankruptcy Probability Using AdaBoost Algorithm-based Support Vector Machine

AdaBoost 알고리즘기반 SVM을 이용한 부실 확률분포 기반의 기업신용평가

  • Shin, Taek-Soo (Division of Business Administration, College of Government and Business, Yonsei University) ;
  • Hong, Tae-Ho (Department of Business Administration, School of Business, Pusan National University)
  • 신택수 (연세대학교 정경대학 경영학부) ;
  • 홍태호 (부산대학교 경영대학 경영학과)
  • Received : 2011.07.27
  • Accepted : 2011.08.10
  • Published : 2011.09.30

Abstract

Recently, support vector machines (SVMs) are being recognized as competitive tools as compared with other data mining techniques for solving pattern recognition or classification decision problems. Furthermore, many researches, in particular, have proved them more powerful than traditional artificial neural networks (ANNs) (Amendolia et al., 2003; Huang et al., 2004, Huang et al., 2005; Tay and Cao, 2001; Min and Lee, 2005; Shin et al., 2005; Kim, 2003).The classification decision, such as a binary or multi-class decision problem, used by any classifier, i.e. data mining techniques is so cost-sensitive particularly in financial classification problems such as the credit ratings that if the credit ratings are misclassified, a terrible economic loss for investors or financial decision makers may happen. Therefore, it is necessary to convert the outputs of the classifier into wellcalibrated posterior probabilities-based multiclass credit ratings according to the bankruptcy probabilities. However, SVMs basically do not provide such probabilities. So it required to use any method to create the probabilities (Platt, 1999; Drish, 2001). This paper applied AdaBoost algorithm-based support vector machines (SVMs) into a bankruptcy prediction as a binary classification problem for the IT companies in Korea and then performed the multi-class credit ratings of the companies by making a normal distribution shape of posterior bankruptcy probabilities from the loss functions extracted from the SVMs. Our proposed approach also showed that their methods can minimize the misclassification problems by adjusting the credit grade interval ranges on condition that each credit grade for credit loan borrowers has its own credit risk, i.e. bankruptcy probability.

최근 몇 년간 SVM(support vector machines)기법은 패턴인식 또는 분류의사결정문제를 위한 분석기법으로서 기존의 데이터마이닝 기법과 비교할 때, 매우 높은 성과를 갖는 것으로 인식되어 왔다. 더 나아나 많은 연구자들은 SVM기법이 1980년대 이후 대표적인 예측 및 분류모형으로 인정받은 인공신경망기법(ANNs : Artificial Neural Networks)에 비해 더 성과가 좋다는 사실을 실증적으로 입증해 왔다(Amendolia et al., 2003; Huang et al., 2004, Huang et al., 2005; Tay and Cao, 2001; Min and Lee, 2005; Shin et al., 2005; Kim, 2003). 일반적으로 이와 같이 다양한 데이터마이닝 기법에 의해 분석되는 이진분류 또는 다분류 의사결정문제들은 특히 금융분야 등에 있어서 오분류비용에 민감하며, 이로 인한 오분류의 경제적 손실도 상대적으로 매우 크다고 할 수 있다. 따라서 기업부도예측모형과 같은 이진분류모형의 결과값을, 부도확률에 기초하여 정교하게 계산된 사후확률의 개념으로서 다분류의 신용등급평가의 문제로 변환할 필요가 있다. 그러나, SVM 모형의 결과값은 기본적으로 그와 같은 부도확률분포를 보여주지 않는다. 따라서, 그러한 확률분포를 정교하게 보여줄 방법을 제시할 필요가 있다(Platt, 1999; Drish, 2001). 본 연구는 AdaBoost 알고리즘기반의 SVM 모형을 이용하여, 이진분류모형으로서 IT 기업의 부실예측모형에 적용한 후, 이 SVM 모형의 예측결과를 SVM의 손실함수에 적용하여 계산된 값을 사후부도확률의 정규분포 특성에 따라 이를 구간화하여 IT기업에 대한 다분류 신용등급 평가의 문제로 전환시키는 방법을 제시하였다. 그리고 본 연구에서 제안하는 방법은 이러한 AdaBoost 알고리즘기반 SVM 모형이 각 기업이 고유한 신용위험(부도확률)을 갖고 있다는 조건하에서, 신용등급부여를 위한 부도확률분포 구간을 정교하게 조정함으로써 오분류 문제를 좀 더 줄일 수 있음을 제시하였다.

Keywords

References

  1. Allwein, E. K., R. E. Schapire and Y. Singer, "Reducing multiclass to binary : a unifying approach for margin classifiers", Journal of Machine Learning Research, Vol.1(2001), 113-141.
  2. Altman, E., "Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy", The Journal of Finance, Vol.23 (1968), 589-609. https://doi.org/10.1111/j.1540-6261.1968.tb00843.x
  3. Amendolia, S. R., G. Cossu, M. L. Ganadu, B. Golosio, G. L. Masala, and G. M. Mura, "A Comparative Study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia Screening", Chemometrics and Intelligent Laboratory Systems, Vol.69(2003), 13-20. https://doi.org/10.1016/S0169-7439(03)00094-7
  4. Baran, A., J. Lakonishok, and A. R. Ofer, "The Value of General Price Level Adjusted Data to Bond Rating", Journal of Business Finance and Accounting, Vol.7, No.1(1980), 135-149. https://doi.org/10.1111/j.1468-5957.1980.tb00203.x
  5. Barniv, R., A. Agarwal, and R. Leach, "Predicting the Outcome Following Bankruptcy Filing : A Three-state Classification Using Neural Networks", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 177-194. https://doi.org/10.1002/(SICI)1099-1174(199709)6:3<177::AID-ISAF134>3.0.CO;2-D
  6. Batchelor, R. and P. Dua, "Forecast Diversity and the Benefits of Combining Forecasts", Management Sciences, Vol.41(1995), 68-75. https://doi.org/10.1287/mnsc.41.1.68
  7. Belkaoui, A., "Industrial Bond Ratings : A New Look", Financial Management, Vol.9, No.3 (1980), 44-51. https://doi.org/10.2307/3664892
  8. Bell, T., "Neural Nets or the Logit Model? A Comparison of Each Modelʼs Ability to Predict Commercial Bank Failures", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 249-264. https://doi.org/10.1002/(SICI)1099-1174(199709)6:3<249::AID-ISAF125>3.0.CO;2-H
  9. Boritz, J. E. and Kennedy, D. B., "Effectiveness of Neural Network Types for Prediction of Business Failure", Expert Systems with Applications, Vol.9, No.4(1995), 503-512. https://doi.org/10.1016/0957-4174(95)00020-8
  10. Breiman, L., "Arcing the edge. Tech. rep. 486, Statistics Department", University of California at Berkeley, 1997a.
  11. Breiman, L., "Prediction games and arcing classifiers, Tech, rep. 504, Statistics Department", University of California at Berkeley, 1997b.
  12. Buta, P., "Mining for Financial Knowledge with CBR", AI Expert, Vol.9, No.2(1994), 34-41.
  13. Chen, S., P. Mangiameli, and D. West, "The Comparative Ability of Self-organizing Neural Networks to Define Cluster Structure", Omega, Vol.23, No.3(1995), 271-279. https://doi.org/10.1016/0305-0483(95)00011-C
  14. Chung, H. and K. Tam, "A Comparative Analysis of Inductive Learning Algorithm", Intelligent Systems in Accounting, Finance and Management, Vol.2(1992), 3-18.
  15. Collins, M., R. E. Schapire, and Y. Singer, "Logistic regression, AdaBoost and Bregman distances", Proceedings of the Thirteenth Annual Conference on Computational Learning Theory, 2000.
  16. Drish. J., "Obtaining Calibrated Probability Estimates from Support Vector Machines", Seminar on Learning Algorithms, San Diego, Spring, 2001.
  17. Ederington, H. L., "Classification Models and Bond Ratings", Financial Review, Vol.20, No.4(1985), 237-262. https://doi.org/10.1111/j.1540-6288.1985.tb00306.x
  18. Etheridge, H. and R. Sriram, "A Comparison of the Relative Costs of Financial Distress Models : Artificial Neural Networks, Logit and Multivariate Discriminant Analysis", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 235-248. https://doi.org/10.1002/(SICI)1099-1174(199709)6:3<235::AID-ISAF135>3.0.CO;2-N
  19. Fan, A. and M. Palaniswami, "Selecting Bankruptcy Predictors Using a Support Vector Machine Approach", in Proceedings of the International Joint Conference on Neural Networks, 2000.
  20. Fan, A. and M. Palaniswami, "A New Approach to Corporate Loan Default Prediction from Financial Statements", Proceedings of the computational finance/forecasting financial markets conference, London (CD), UK, 2000.
  21. Fletcher, D. and E. Goss, "Forecasting with Neural Networks : An Application Using Bankruptcy Data", Information and Management, Vol.24, No.3(1993), 159-167. https://doi.org/10.1016/0378-7206(93)90064-Z
  22. Freund, Y. and R. E. Schapire, "A decision theoretic generalization of online learning and an application to boosting", Journal of Computer and System Sciences, Vol.55, No.1(1997), 119-139. https://doi.org/10.1006/jcss.1997.1504
  23. Friedman, J., T. Hastie, and R. Tibshirani, "Additive logistic regression : a statistical view of boosting", The Annals of Statistics, Vol.38, No.2(2000), 337-374.
  24. Härdle, W., R. A. Moro, and D. Schäfer, "Predicting corporate bankruptcy with support vector machines", Working Slide, Humboldt University and the German Institute for Economic Research available on, 2003.
  25. Horrigan, J. O., "The Determination of Long Term Credit Standing with Financial Ratios", Journal of Accounting Research, Supplement, (1966), 44-62.
  26. Huang, W., Y. Nakamori, and S. Wang, "Forecasting Stock Market Movement Direction with Support Vector Machine", Computers and Operations Research, Vol.32(2005), 2513-2522. https://doi.org/10.1016/j.cor.2004.03.016
  27. Huang, Z., H. Chen, C. Hsu, W. Chen, and S. Wu, "Credit rating analysis with support vector machines and neural networks : a market comparative study", Decision Support Systems, Vol.37, No.4(September 2004), 543-558. https://doi.org/10.1016/S0167-9236(03)00086-1
  28. Jo, H., I. Han, and H. Lee, "Bankruptcy Prediction Using Case-based Reasoning, Neural Networks, and Discriminant Analysis, Expert Systems with Applications, Vol.13, No.2 (1997), 97-108. https://doi.org/10.1016/S0957-4174(97)00011-0
  29. Kaplan, R. S. and G. Urwitz, "Statistical Models of Bond Ratings : A Methodological Inquiry", Journal of Business, Vol.52, No.2(1979), 231-262. https://doi.org/10.1086/296045
  30. Kim, B. O. and S. M. Lee, "A Bond Rating Expert System for Industrial Companies", Expert Systems with Applications, Vol.9, No.1(1995), 63-70. https://doi.org/10.1016/0957-4174(94)00049-2
  31. Kim, K. J., "Financial Time Series Forecasting Using Support Vector Machines", Neurocomputing, Vol.55, No.1-2(2003), 307-319. https://doi.org/10.1016/S0925-2312(03)00372-2
  32. Kingdom, J. and K. Feldman, "Genetic Algorithms for Bankruptcy Prediction", Search Space Research Report, No.01-95, Search Space Ltd. London, 1995.
  33. Kwon, Y. S., I. Han, and K. C. Lee, "Ordinal Pairwise Partitioning (OPP) Approach to Neural Networks Training in Bond Rating", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 23-40. https://doi.org/10.1002/(SICI)1099-1174(199703)6:1<23::AID-ISAF113>3.0.CO;2-4
  34. Maher, J. J. and T. K. Sen, "Predicting Bond Ratings Using Neural Networks : A Comparison with Logistic Regression", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 59-72. https://doi.org/10.1002/(SICI)1099-1174(199703)6:1<59::AID-ISAF116>3.0.CO;2-H
  35. Markham, I. and C. Ragsdale, "Combining Neural Networks and Statistical Predictions to Solve the Classification Problem in Discriminant Analysis", Decision Sciences, Vol.26, No.2 (1995), 229-242. https://doi.org/10.1111/j.1540-5915.1995.tb01427.x
  36. Mason, L., J. Baxter, P. Bartlett, and M. Frean, "Functional gradient techniques for combining hypotheses. In Smola, A. J., Bartlett, P. J., Sch¨olkopf, B., and Schuurmans, D. (Eds.)", Advances in Large Margin Classifiers, MIT Press, 1999.
  37. Messier, W. and J. Hansen, "Inducing Rules for Expert System Development : and Example Using Default and Bankruptcy Data", Management Science, Vol.34, No.12(1988), 1403-1415. https://doi.org/10.1287/mnsc.34.12.1403
  38. Min, J. H. and Y. Lee, "Bankruptcy Prediction Using Support Vector Machine with Optimal Choice of Kernel Function Parameters", Expert Systems with Applications, Vol.28 (2005), 603-614. https://doi.org/10.1016/j.eswa.2004.12.008
  39. Odom, M. and R. Sharda, "A Neural Networks Model for Bankruptcy Prediction", Proceedings of the IEEE International Conference in Neural Networks, San Diego, CA., Vol.II (1990), 163-168.
  40. Ohlson, J., "Financial Ratios and the Probabilistic Prediction of Bankruptcy", Journal of Accounting Research, Vol.18, No.1(1980), 109-131. https://doi.org/10.2307/2490395
  41. Park, J., "Bankruptcy Prediction of Banks and Insurance Companies : An Approach Using Inductive Methods", Ph.D. Dissertation, University of Texas at Austin, 1993.
  42. Pinches, G. E. and K. A. Mingo, "A Multivariate Analysis of Industrial Bond Ratings", Journal of Finance, Vol.28, No.1(1973), 1-18. https://doi.org/10.1111/j.1540-6261.1973.tb01341.x
  43. Platt, J., "Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods", In Advances in Large Margin Classifiers, 1999.
  44. Pogue, T. F. and R. M. Soldofsky, "Whatʼs in a Bond Rating?", Journal of Financial and Quantitative Analysis, Vol.4, No.2(1969), 201-228. https://doi.org/10.2307/2329840
  45. Salchenberger, L., E. Cinar, and N. Lash, "Neural Networks : A New Tool for Predicting Thrift Failures", Decision Sciences, Vol.23(1992), 899-916. https://doi.org/10.1111/j.1540-5915.1992.tb00425.x
  46. Schapire, R. E. and Y. Singer, "Improved boosting algorithms using confidence-rated predictions", Machine Learning, Vol.37, No.3 (1999), 297-336. https://doi.org/10.1023/A:1007614523901
  47. Shin, K. S., T. S. Shin, and I. Han, "Using Induction Technique to Support Case-based Reasoning : A Case of Corporate Bond Rating", Proceedings of MS/OR Society Conference, Seoul, Korea, (1997), 199-202.
  48. Shin, K., T. S. Lee, and H. Kim, "An Application of Support Vector Machines in Bankruptcy Prediction Model", Expert Systems with Applications, Vol.28, No.1(January 2005), 127-135. https://doi.org/10.1016/j.eswa.2004.08.009
  49. Singleton, J. C. and A. J. Surkan, "Bond Rating with Neural Networks", In Refenes, A. ed., Neural Networks in the Capital Markets, John Wiley, Chichester, 1995.
  50. Sollich, P., "Bayesian Methods for Support Vector Machines : Evidence and Predictive Class Probabilities", Machine Learning, Vol.46, No.1-3(2002), 21-52. https://doi.org/10.1023/A:1012489924661
  51. Tam, K. and M. Kiang, "Managerial Applications of Neural Networks : The Case of Bank Failure Predictions", Management Science, Vol.38, No.7(1992), 926-947. https://doi.org/10.1287/mnsc.38.7.926
  52. Tay, F. E. H. and L. Cao, "Application of Support Vector Machine in Financial Time Series Forecasting", Omega, Vol.29, No.4(2001), 309-317. https://doi.org/10.1016/S0305-0483(01)00026-3
  53. Vapnik, V., The Nature of Statistical Learning Theory, New York, Springer-Verlag, 1995. West, R. R., "An Alternative Approach to Predicting Corporate Bond Ratings", Journal of Accounting Research, Vol.8, No.1(1970), 118-125. https://doi.org/10.2307/2674717
  54. Wilson, R. and R. Sharda, "Bankruptcy Prediction Using Neural Networks", Decision Support Systems, Vol.11, No.5(1994), 545-557. https://doi.org/10.1016/0167-9236(94)90024-8
  55. Zadrozny, B. and C. Elkan, "Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers", in Proceedings of the Eighteenth International Conference on Machine Learning, 2001.
  56. Zmijewski, M. E., "Methodological Issues Related to the Estimated of Financial Distress Prediction Models", Journal of Accounting Research, Vol.22, No.1(1984), 59-82. https://doi.org/10.2307/2490859

Cited by

  1. 회사채 신용등급 예측을 위한 SVM 앙상블학습 vol.18, pp.2, 2011, https://doi.org/10.13088/jiis.2012.18.2.029
  2. 경영분석지표와 의사결정나무기법을 이용한 유상증자 예측모형 개발 vol.18, pp.4, 2011, https://doi.org/10.13088/jiis.2012.18.4.059
  3. 적응형 부스팅을 이용한 파산 예측 모형: 건설업을 중심으로 vol.20, pp.1, 2011, https://doi.org/10.13088/jiis.2014.20.1.035
  4. Financial Distress Prediction Models for Wind Energy SMEs vol.10, pp.4, 2011, https://doi.org/10.5392/ijoc.2014.10.4.075
  5. 개선된 배깅 앙상블을 활용한 기업부도예측 vol.20, pp.4, 2014, https://doi.org/10.13088/jiis.2014.20.4.121
  6. A Hybrid Under-sampling Approach for Better Bankruptcy Prediction vol.21, pp.2, 2015, https://doi.org/10.13088/jiis.2015.21.2.173
  7. 재무부실화 예측을 위한 랜덤 서브스페이스 앙상블 모형의 최적화 vol.14, pp.4, 2011, https://doi.org/10.9716/kits.2015.14.4.121
  8. SVM과 meta-learning algorithm을 이용한 고지혈증 유병 예측모형 개발과 활용 vol.24, pp.2, 2018, https://doi.org/10.13088/jiis.2018.24.2.111