DOI QR코드

DOI QR Code

Early Software Quality Prediction Using Support Vector Machine

Support Vector Machine을 이용한 초기 소프트웨어 품질 예측

  • Received : 2011.03.19
  • Accepted : 2011.06.11
  • Published : 2011.06.30

Abstract

Early criticality prediction models that determine whether a design entity is fault-prone or not are becoming more and more important as software development projects are getting larger. Effective predictions can reduce the system development cost and improve software quality by identifying trouble-spots at early phases and proper allocation of effort and resources. Many prediction models have been proposed using statistical and machine learning methods. This paper builds a prediction model using Support Vector Machine(SVM) which is one of the most popular modern classification methods and compares its prediction performance with a well-known prediction model, BackPropagation neural network Model(BPM). SVM is known to generalize well even in high dimensional spaces under small training data conditions. In prediction performance evaluation experiments, dimensionality reduction techniques for data set are not used because the dimension of input data is too small. Experimental results show that the prediction performance of SVM model is slightly better than that of BPM and polynomial kernel function achieves better performance than other SVM kernel functions.

Keywords

References

  1. Ebert, C., "Fuzzy classification for software criticality analysis", Expert Systems with Applications, Vol.11, No.3(1996), pp.323-342. https://doi.org/10.1016/S0957-4174(96)00048-6
  2. Vapnik, V., The Nature of Statistical Learning Theory, Springer New York, 1995.
  3. Elish, K. O. and M. O. Elish, "Predicting defect prone software modules using support vector machines", J. Systems Software, Vol. 81, No.5(2008), pp.649-660. https://doi.org/10.1016/j.jss.2007.07.040
  4. Catal, C. and B. Diri, "A systematic review of software fault prediction studies", Expert Systems with Applications, Vol.36, No.4(2009), pp.7346-7354. https://doi.org/10.1016/j.eswa.2008.10.027
  5. 홍의석, "소프트웨어 품질 예측 모델을 위한 분류 프레임워크", 한국콘텐츠학회논문지, 제10권, 제6호(2010), pp.134-143.
  6. Emam, K. E., W. Melo and J. C. Machado, "The prediction of faulty classes using object oriented design metrics", J. Systems Software, Vol.56, No.1(2001), pp.63-75. https://doi.org/10.1016/S0164-1212(00)00086-8
  7. Tian, J., A. Nguyen, C. Allen, and R. Appan, "Experience with identifying and characterizing problem-prone modules in telecommunication software systems", J. Systems Software, Vol.57, No.3(2001), pp.207-215. https://doi.org/10.1016/S0164-1212(01)00011-5
  8. Khoshgoftaar, T. M. and D. L. Lanning, "A Neural Network Approach for Early Detection of Program Modules Having High Risk in the Maintenance Phase", J. Systems Software, Vol.29, No.1(1995), pp.85-91. https://doi.org/10.1016/0164-1212(94)00130-F
  9. 홍의석, "훈련데이터 집합을 사용하지 않는 소프트웨어 품질예측 모델," 정보처리학회논문지, 제10-D권, 제4호(2003), pp.689-696.
  10. Zhong, S., T. M. Khoshgoftaar, and N. Seliya, "Analyzing Software Measurement Data with Clustering Techniques", IEEE Intelligent Systems, Vol.19, No.2(2004), pp. 20-27. https://doi.org/10.1109/MIS.2004.1274907
  11. Seliya, N. and T. M. Khoshgoftaar, "Software quality analysis of unlabeled program modules with semisupervised clustering", IEEE Trans, Systems, Man and Cybernetics, Vol.37, No.2(2007), pp.201-211. https://doi.org/10.1109/TSMCA.2006.889473
  12. Xing, F., P. Guo, and M. R. Lyu, "A Novel Method for Early Software Quality Prediction Based on Support Vector Machine", Proc. International Conference on Software Reliability Engineering, (2005) pp.213-222.
  13. Gondra, I., "Applying machine learning to software fault-proneness prediction", J. Systems Software, Vol.81, No.2(2008), pp.186-195. https://doi.org/10.1016/j.jss.2007.05.035
  14. 김영미, 정충희, 김현수, "SVM을 이용한 위험모듈 예측", 정보과학회논문지:컴퓨팅의 실제 및 레터, 제15권, 제6호(2009), pp.435-439.