The Prediction of DEA based Efficiency Rating for Venture Business Using Multi-class SVM

다분류 SVM을 이용한 DEA기반 벤처기업 효율성등급 예측모형

  • Park, Ji-Young (College of Business Administration, Pusan National University) ;
  • Hong, Tae-Ho (College of Business Administration, Pusan National University)
  • Published : 2009.06.30

Abstract

For the last few decades, many studies have tried to explore and unveil venture companies' success factors and unique features in order to identify the sources of such companies' competitive advantages over their rivals. Such venture companies have shown tendency to give high returns for investors generally making the best use of information technology. For this reason, many venture companies are keen on attracting avid investors' attention. Investors generally make their investment decisions by carefully examining the evaluation criteria of the alternatives. To them, credit rating information provided by international rating agencies, such as Standard and Poor's, Moody's and Fitch is crucial source as to such pivotal concerns as companies stability, growth, and risk status. But these types of information are generated only for the companies issuing corporate bonds, not venture companies. Therefore, this study proposes a method for evaluating venture businesses by presenting our recent empirical results using financial data of Korean venture companies listed on KOSDAQ in Korea exchange. In addition, this paper used multi-class SVM for the prediction of DEA-based efficiency rating for venture businesses, which was derived from our proposed method. Our approach sheds light on ways to locate efficient companies generating high level of profits. Above all, in determining effective ways to evaluate a venture firm's efficiency, it is important to understand the major contributing factors of such efficiency. Therefore, this paper is constructed on the basis of following two ideas to classify which companies are more efficient venture companies: i) making DEA based multi-class rating for sample companies and ii) developing multi-class SVM-based efficiency prediction model for classifying all companies. First, the Data Envelopment Analysis(DEA) is a non-parametric multiple input-output efficiency technique that measures the relative efficiency of decision making units(DMUs) using a linear programming based model. It is non-parametric because it requires no assumption on the shape or parameters of the underlying production function. DEA has been already widely applied for evaluating the relative efficiency of DMUs. Recently, a number of DEA based studies have evaluated the efficiency of various types of companies, such as internet companies and venture companies. It has been also applied to corporate credit ratings. In this study we utilized DEA for sorting venture companies by efficiency based ratings. The Support Vector Machine(SVM), on the other hand, is a popular technique for solving data classification problems. In this paper, we employed SVM to classify the efficiency ratings in IT venture companies according to the results of DEA. The SVM method was first developed by Vapnik (1995). As one of many machine learning techniques, SVM is based on a statistical theory. Thus far, the method has shown good performances especially in generalizing capacity in classification tasks, resulting in numerous applications in many areas of business, SVM is basically the algorithm that finds the maximum margin hyperplane, which is the maximum separation between classes. According to this method, support vectors are the closest to the maximum margin hyperplane. If it is impossible to classify, we can use the kernel function. In the case of nonlinear class boundaries, we can transform the inputs into a high-dimensional feature space, This is the original input space and is mapped into a high-dimensional dot-product space. Many studies applied SVM to the prediction of bankruptcy, the forecast a financial time series, and the problem of estimating credit rating, In this study we employed SVM for developing data mining-based efficiency prediction model. We used the Gaussian radial function as a kernel function of SVM. In multi-class SVM, we adopted one-against-one approach between binary classification method and two all-together methods, proposed by Weston and Watkins(1999) and Crammer and Singer(2000), respectively. In this research, we used corporate information of 154 companies listed on KOSDAQ market in Korea exchange. We obtained companies' financial information of 2005 from the KIS(Korea Information Service, Inc.). Using this data, we made multi-class rating with DEA efficiency and built multi-class prediction model based data mining. Among three manners of multi-classification, the hit ratio of the Weston and Watkins method is the best in the test data set. In multi classification problems as efficiency ratings of venture business, it is very useful for investors to know the class with errors, one class difference, when it is difficult to find out the accurate class in the actual market. So we presented accuracy results within 1-class errors, and the Weston and Watkins method showed 85.7% accuracy in our test samples. We conclude that the DEA based multi-class approach in venture business generates more information than the binary classification problem, notwithstanding its efficiency level. We believe this model can help investors in decision making as it provides a reliably tool to evaluate venture companies in the financial domain. For the future research, we perceive the need to enhance such areas as the variable selection process, the parameter selection of kernel function, the generalization, and the sample size of multi-class.

Keywords

References

  1. Ahn, H., Kim, K.-j, and Han, I., 'Intelligent Credit Rating Model for Korean Companies using Multiclass Support Vector Machines,' Korean Management Review, Vol. 35, No. 5, 2006, pp. 1479-1496
  2. Banker, R.D., Charnes A., and Cooper, W.W., 'Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis,' Management Science, Vol. 30, No. 9, 1984, pp. 1078-1092 https://doi.org/10.1287/mnsc.30.9.1078
  3. Barua, P.L., Brockette, P.L., Cooper, W.W., Deng, H., Parket, B.R. Ruefli, T.W., and Winston, A., 'DEA evaluations of long-and short-run efficiencies of digital vs. physical product 'dot com' companies,' Socio Economic Planning Sciences, Vol. 38, No. 4, 2004, pp. 233-253 https://doi.org/10.1016/j.seps.2003.10.002
  4. Carlos, S.C., Yolanda, F.C., and Cecilio, M.M., 'Measuring DEA efficiency in Internet companies,' Decision Support Systems, Vol. 38, No. 4, 2005, pp. 557-573 https://doi.org/10.1016/j.dss.2003.08.004
  5. Charnes, W.W., Cooper, and Rhodes, E., 'Measuring the Efficiency of Decision Making Units,' European Journal of Operation Research, Vol. 2, No. 6, 1978, pp. 429-444 https://doi.org/10.1016/0377-2217(78)90138-8
  6. Crammer, K. and Singer, Y., 'On the learnability and design of output codes for multiclass problems,' Comput. Learning Theory, 2000, pp. 35-46
  7. Farrell, M.J., 'The Measurement of Productive Efficiency,' Journal of the Royal Statistical Society, Vol. 120, 1957, pp. 253-281 https://doi.org/10.2307/2343100
  8. Garavaglia, S., 'An application of a Counter Propagation Neural Networks: Simulating the Standard and Poor's Corporate Bond Rating Systems,' Proceedings of the First International Conference on Artificial Intelligence on Wall Street, 1991, pp. 278-287
  9. Hearst, M.A., Dumais, S.T., Osman, E., Platt, j.. and Scholkopf, B., 'Support vector machines,' IEEE lntelligent System, Vol. 13, No. 4, 1998, pp. 18-28 https://doi.org/10.1109/5254.708428
  10. Hong, H.K., Shin, C.K., Park, S.C., and Kim. S.H., 'Evaluating thc efficiency of system íntegration projects using data envelopment analysis(DEA) and machine learning,' Expert Systems with Applications, Vol. 16, 1999, pp. 283-296 https://doi.org/10.1016/S0957-4174(98)00077-3
  11. Hong, T. and Park, J., 'A DEA-Based Data Mining for the Evaluation of the Efficiency in the IT Venture Business,' Proceedings of the 13th Asia Pacfic Management Conference, 2007
  12. Hong, T., Park, J., and Kim, E., 'Using DEA-logit Approach to Evaluate Efficiency of IT Venture Business,' The Journal of Internet Electronic Commerce Research, Vol. 7, No. 3, 2007, pp. 429-449
  13. Hsu, C.W. and Lin, C.J., 'A Comparison of Methods for Multiclass Support Vector Machines,' IEEE Transactions on Neural Networks, Vol. 13, No. 2, 2002, pp. 415-425 https://doi.org/10.1109/72.991427
  14. Hu, M.Y., Zhang, G., Christine, X.J., and Patuwo, B.E., 'A cross-validation analysis of neural network out-of-sample performance in exchange rate forecasting,' Decision Sciences, Vol. 30, No. 1, 1999, pp. 197-216 https://doi.org/10.1111/j.1540-5915.1999.tb01606.x
  15. Huang, Z., Chen, H., Hsu, C-J., Chen, W-H., and Wu, S., 'Credit rating analysis with support vector machines and neural network: a market comparative study,' Decision Support Systems, Vol. 37, 2004, pp. 543-558 https://doi.org/10.1016/S0167-9236(03)00086-1
  16. Hwang, W.W. and Yang, B.S., 'Fault Diagnosis of Rotating Macrunery Using Multiclass Support Vector Machines,' Transactions of the Korean Society for Noise and Vibration Engineering, Vol. 14, No. 12, 2004, pp. 1233-1240 https://doi.org/10.5050/KSNVN.2004.14.12.1233
  17. Kim, J.J., 'Research Papers: The Effect of DEA Efficiency on ROE of KOSDAQ Firms,' TAX ACCOUNTING RESEARCH, Vol. 14, 2004, pp. 153-170
  18. Kim, J.W., 'Expert systems for bond rating: a comparative analysis of statistical, rule -based and neural network systems,' Expert Systems, Vol. 10, 1993, pp. 167-171 https://doi.org/10.1111/j.1468-0394.1993.tb00093.x
  19. Kim K.J., 'Productivity Analysis of Software Industry in Korea,' Productivify Review, Vol. 19, No. 1, 2005, pp. 95-111
  20. Kim, K.S., 'Analyzing the Technical Efficiency of Korean System Integration Firms Using DEA and Malmquist Productivity Analysis,' Journal of the Korea society of IT services, Vol. 5, No. 1, 2006, pp. 1-16
  21. Koo, C.O., Kim, G.J., and Park, J.W., 'The Productivity Analysis of KOSDAQ Software Companies,' Journal Finance and Accounting Accountiong Information, Vol. 6, No. 1, 2006, pp. 27-51
  22. Kwon, Y.S., Han, I.G., and Lee, K.C., 'Ordinal Pairwise Partitioning(OPP) approach to neural networks training in bond rating,' Intelligent Systems in Accounting, Finance and Management, Vol. 6, 1997, pp. 23-40 https://doi.org/10.1002/(SICI)1099-1174(199703)6:1<23::AID-ISAF113>3.0.CO;2-4
  23. Lee, K.-J., Kim, J.-J., and Geon, C., 'The Evaluation of the Efficiencyof Internet Companies Using a DEA Model,' DAEHAN Association of Business Administration, Vol. 20, No. 1, 2007, pp. 109-136
  24. Lee, Y.-C., 'Credit Risk Evaluation Using Worst Practice DEA,' DAEHAN Association of Business Administration, Vol. 19, No. 5, 2006, pp. 183-203
  25. Min, J.H. and Jeong, C.W., 'Nonparametric Approach to Bankruptcy Prediction: Developíng Cross Peeling Technique integrating DEA and Negative DEA,' Korean Management Review, Vol. 35, No. 4, 2006, pp. 1157-1180
  26. Nguyen, M.H. and Rajapakse, J.C., 'MultiClass Support Vector Machines for Protein Secondary Structure Prediction,' Genome Informatics, Vol. 14, 2003, pp. 218-227
  27. Paradi, J.C, Asmild, M., and Simark, P.C., 'Using DEA and worst practice DEA in credit risk evaluation,' Journal of Productive Analysis, Vol. 21, No. 2, 2004, pp. 153-165 https://doi.org/10.1023/B:PROD.0000016870.47060.0b
  28. Platt, J.C., Cristianini, N., and Shawe-Taylor, J., 'Large Margin DAGs for Multiclass Classification,' in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, Vol. 12, 2000, pp. 547-553
  29. Shin, K.S. and Han, I., 'A case-based approach using inductive indexing for corporate bond rating,' Decision Support Systems, Vol. 32, No. 1, 2001, pp. 41-52 https://doi.org/10.1016/S0167-9236(01)00099-9
  30. Song, K.M., Bahn, S.S., and Kim, S.T., 'The Significance of Financial Model in Venture Credit Rating and Its Comparison with the Case of Large Corporate,' The Korean Venture Management Review, Vol. 10, No. 2, 2007, pp. 1-23
  31. Tay, F.E.H. and Cao, L.J., 'Application of support vector machines in financial time series forecasting,' Omega, Vol. 29, No. 4, 2001, pp. 309-317 https://doi.org/10.1016/S0305-0483(01)00026-3
  32. Vapnik, V., Statistical, tistical Learning Theory, New York, Wiley, 1998
  33. Weston, J. and Watkins, C., 'Multi-class support vector machines,' presented at the Proc. ESAMM99, M. Verleysen, Ed., Brussels, Belgium, 1999
  34. Zhu, J., 'Multi-factor performance measure model with an application to Fortune 500 companies,' European Journal of Operational Research, Vol. 123, No. 1, 2000, pp. 105-124 https://doi.org/10.1016/S0377-2217(99)00096-X
  35. Zhu, J., 'Multi-factor performance measure model with an application to Fortune 500 companies,' European Journal of Operational Research, Vol. 123, No. 1, pp. 105-124 https://doi.org/10.1016/S0377-2217(99)00096-X
  36. Zhu, J., Quantitative Model For Performance Evaluation And Benchmarking, Kluwer Academic Publishers, 2003