DOI QR코드

DOI QR Code

A Study on Polynomial Neural Networks for Stabilized Deep Networks Structure

안정화된 딥 네트워크 구조를 위한 다항식 신경회로망의 연구

  • Jeon, Pil-Han (School of Electrical and Electronic Engineering, the University of Suwon) ;
  • Kim, Eun-Hu (School of Electrical and Electronic Engineering, the University of Suwon) ;
  • Oh, Sung-Kwun (School of Electrical and Electronic Engineering, the University of Suwon)
  • Received : 2017.01.19
  • Accepted : 2017.09.26
  • Published : 2017.12.01

Abstract

In this study, the design methodology for alleviating the overfitting problem of Polynomial Neural Networks(PNN) is realized with the aid of two kinds techniques such as L2 regularization and Sum of Squared Coefficients (SSC). The PNN is widely used as a kind of mathematical modeling methods such as the identification of linear system by input/output data and the regression analysis modeling method for prediction problem. PNN is an algorithm that obtains preferred network structure by generating consecutive layers as well as nodes by using a multivariate polynomial subexpression. It has much fewer nodes and more flexible adaptability than existing neural network algorithms. However, such algorithms lead to overfitting problems due to noise sensitivity as well as excessive trainning while generation of successive network layers. To alleviate such overfitting problem and also effectively design its ensuing deep network structure, two techniques are introduced. That is we use the two techniques of both SSC(Sum of Squared Coefficients) and $L_2$ regularization for consecutive generation of each layer's nodes as well as each layer in order to construct the deep PNN structure. The technique of $L_2$ regularization is used for the minimum coefficient estimation by adding penalty term to cost function. $L_2$ regularization is a kind of representative methods of reducing the influence of noise by flattening the solution space and also lessening coefficient size. The technique for the SSC is implemented for the minimization of Sum of Squared Coefficients of polynomial instead of using the square of errors. In the sequel, the overfitting problem of the deep PNN structure is stabilized by the proposed method. This study leads to the possibility of deep network structure design as well as big data processing and also the superiority of the network performance through experiments is shown.

Keywords

References

  1. V. Rouss, W. Charon, and G. Cirrincione, "Neural model of the dynamic behaviour of a non-linear mechanical system," Mechanical Systems and Signal Processing, Vol 23, pp. 1145-1159, 2009. https://doi.org/10.1016/j.ymssp.2008.09.004
  2. A. G. Ivahnenko, "Polynomial theory of complex systems," IEEE Trans. on Systems, Man and Cybernetics, Vol. SMC-1, pp. 364-378, 1971. https://doi.org/10.1109/TSMC.1971.4308320
  3. S. K. Oh and W. Pedrycz, "The Design of Self-Organizing Polynomial Neural Networks,"Information Sciences, Vol. 141, pp. 237-258, 2002. https://doi.org/10.1016/S0020-0255(02)00175-5
  4. S. K. Oh, T. C. Ahn, and W. Pedrycz, "Fuzzy Polynomial Neural Networks-Based Structure and Its Application to Nonlinear Process Systems," 7th IFSA World Conference, Vol. 2, pp. 495-499, 1997.
  5. Ho-Sung Park, Ki-Sang Kim, Sung-Kwun Oh "Design of Particle Swarm Optimization-based Polynomial Neural Networks"THE TRANSACTION OF THE KOREAN INSTITUTE OF ELECTRICAL ENGINEERS 60(2), 2011.2, 398-406 https://doi.org/10.5370/KIEE.2011.60.2.398
  6. AE Hoerl, RW Kennard. "Ridge regression: Biased estimation for nonorthogonal problems", Technometrics, Vol. 12, No. 1, p. 55-67, 1970 https://doi.org/10.1080/00401706.1970.10488634
  7. A. N. Tikhonov, and V. Y. "Arsenin, solution of ill-posed problems.", Washington: Winston & Sons, 1977
  8. Q. Fan, J,. M. Zurada, W. Wu, "Convergence of online gradient method for feedforward neural networks with smoothing L1/L2 regylarization penalty", Neurocomputing, Vol 131, pp. 208-216, 2014 https://doi.org/10.1016/j.neucom.2013.10.023
  9. Y. H. Pyo, K. H. Lee, K. H. You, "Earthquake magnitude estimation using Recursive Least Squares", IEEE International Conference on Cloud Computing and Big Data, pp. 394-397, 2016
  10. H. S. Park, Y, H. Jin, S. K Oh, "Evolutionary Design of Radial Basis Function-based Polynomial Neural Network with the aid of Information Granulation", The Tr.nsactions of Korean Institute of Electrical Engineers, Vol. 60, No. 4, pp. 862-870, 2011. https://doi.org/10.5370/KIEE.2011.60.4.862
  11. S. K. Oh, W. D. Kim, H. S. Park, M. H. Son, "Identification Methodology of FCM-based Fuzzy Model Using Particle Swarm Optimization", The Transactions of Korean Institute of Electrical Engineers, Vol. 60, No. 1, pp. 184-192, 2011. https://doi.org/10.5370/KIEE.2011.60.1.184
  12. L. P. Maguire, B. Roche, T. M. McGinnity, L. J. McDaid, "Predicting a chaotic time series using a fuzzy neural network", Information Science, Vol. 112, pp. 125-136, 1998. https://doi.org/10.1016/S0020-0255(98)10026-9
  13. S. K. Oh, Y. H. Kim, H. S. Park, J. T. Kim, "Design of Data-centroid Radial Basis Function Neural Network with Extended Polynomial Type and Its Optimization", The Transactions of Korean Institute of Electrical Engineers, Vol. 60, No. 3, pp. 639-647, 2011. https://doi.org/10.5370/KIEE.2011.60.3.639