DOI QR코드

DOI QR Code

Hydrological Modelling of Water Level near "Hahoe Village" Based on Multi-Layer Perceptron

  • Oh, Sang-Hoon (Division of Information Communication Engineering Mokwon University) ;
  • Wakuya, Hiroshi (Graduate School of Science and Engineering Saga University)
  • Received : 2016.01.22
  • Accepted : 2016.02.16
  • Published : 2016.03.28

Abstract

"Hahoe Village" in Andong region is an UNESCO World Heritage Site. It should be protected against various disasters such as fire, flooding, earthquake, etc. Among these disasters, flooding has drastic impact on the lives and properties in a wide area. Since "Hahoe Village" is adjacent to Nakdong River, it is important to monitor the water level near the village. In this paper, we developed a hydrological modelling using multi-layer perceptron (MLP) to predict the water level of Nakdong River near "Hahoe Village". To develop the prediction model, error back-propagation (EBP) algorithm was used to train the MLP with water level data near the village and rainfall data at the upper reaches of the village. After training with data in 2012 and 2013, we verified the prediction performance of MLP with untrained data in 2014.

Keywords

1. INTRODUCTION

DPD (Disaster Prevention Design) concept has been proposed in order to realize securable society from the catastrophic disasters such as earthquake, firing, landslide, storm, and tsunami etc [1]. Especially, DPD emphasizes an alarm system for preemptive protection rather than recovery strategy of the damage occurred. It is estimated that of the total economic loss caused by all kinds of disasters, 40% are due to flooding [2]. Although flooding has been focused because of the drastic impact, however, the relationship between rainfall and flood discharge is very complex and flood prediction is still unsolved problem [2].

Based on the mathematical proofs that multi-layer perceptron (MLP) neural networks can approximate any function with enough number of hidden nodes [3]-[5], MLPs have been widely applied in various fields such as pattern recognition, speech recognition, time series prediction, hydrology etc. Furthermore, deep learning enlarges the application area to image understanding and language processing [6]. Especially, it is not necessary to elucidate complex mechanisms of phenomena to be modelled and this property expands hydrological modelling using MLPs [7]-[12].

UNESCO concluded the “Convention Concerning the Protection of the World Cultural and Natural Heritage” in 1972 for national and international protection activities of world heritages. Among many UNESCO world heritage sites in Korea, “Hahoe Village” in Andong region is adjacent to Nakdong River and, therefore, the water level near the village should be carefully monitored for protection from flooding.

In this paper, we develop a hydrological modelling using MLP to predict the water level of Nakdong River near “Hahoe Village” with rainfalls at upper reaches of the village. In section 2, we briefly introduce the architecture of MLP and its training algorithm. In section 3, the hydrological modelling of rainfalls and water level in Andong region is described and verified through simulations with real data. Finally, section 4 concludes this paper

 

2. MULTI-LAYER PERCEPTRON AND ERROR BACKPROPAGATION ALGORIHTM

Consider an MLP consisting of N inputs, H hidden nodes, and M output nodes, as shown in Fig. 1. When a sample (p = 1, 2,…, P) is presented to the input layer of MLP, by the forward propagation, the j-th hidden node is given by

Fig. 1.The architecture of a multilayer perceptron.

Here, wji denotes the weight connecting xi to hj, wj0 is a bias, and tanh(.) is the sigmoidal activation function of hidden node. The k-th output node with the sigmoidal activation function is given by

where

Also, vk0 is a bias and vkj denotes the weight connecting hj to yk .

Let the desired output vector corresponding to a training sample . As a distance measure between the actual and desired outputs, we usually use the mean-squared error (MSE) function for P training samples defined by

To minimize E, according to the negative gradient of MSE, output weights vkj ’s are iteratively updated by

where

is the error signal of output node and η is the learning rate. Also, by the backward propagation of the error signal, hidden weights wji’s are updated by

In Eq. (7), the error signal of output node is back-propagated through output weights vkj . The above weight-updating procedure for training of MLPs is the EBP (error backpropagation) algorithm [13].

We usually adopt the sigmoidal activation function of output node for classification problems, in which desired values of output nodes are in two extremely saturated regions of the sigmoid function [14]. If we want to generate “warning-no warning signal” for flood forecasting, it belongs to classification problems [2]. However, our goal is to predict the water level near “Hahoe Village”, which is a real number above zero, and we adopt a linear function as an activation function of output node. Consequently, in Eq. (2) and Eq. (6) becomes

 

3. HYDROLOGICAL MODELLING IN ANDONG REGION

Based on an MLP with a linear output node, we develop a hydrological modelling in Andong region, that is, the relationship between water level near “Hahoe Village” and rainfalls at upper reaches of the village. Fig. 2 is a map of Andong region, in which Nakdong River flows from Andong Dam located east of Andong to “Hahoe Village”. Red circles in Fig. 2 indicate the locations of water level gauge at “Gudam” and three rainfall gauges at “Pungsan”, “Iljik”, and “Andong”. Since there is not a water level gauge at “Hahoe Village” and the gauge at “Gudam” is the nearest one, we select the gauge at “Gudam” to monitor the water level near “Hahoe Village”. Also, the rainfalls at “Pungsan” and “Iljik” should be considered for the hydrological modelling, since there are tributary rivers from the two locations to Nakdong River.

Fig. 2.The map of Andong region. Red circles indicate the locations of water level gauge at “Gudam” and rainfall gauges at “Pungsan”, “Iljik”, and “Andong”

Avoiding the winter season of icing and snowing, the data at each gauge is collected from March 1st to November 30th in the year of 2012, 2013, and 2014 with the interval of one hour. So, there are 6,600 records in each year. The total data was provided by “Nakdong River Flood Control Office” and “Korea Water Resource Corp.” We use the data in 2012 and 2013 to train MLPs for the hydrological modelling and the other to test the performance of water level prediction.

In order to predict the water level at “Gudam” after D hours, we construct an MLP whose input layer consists of the water level at “Gudam” and the rainfalls at “Pungsan”, “Iljik”, and “Andong” from current (denoted by “n”) to previous n-L hours. When determining the number of parameters in MLPs, we usually adopt the method of trial and error since there is not a concrete theoretical guidance. Accordingly, we determine that L is two and the number of hidden nodes is forty through many trials and errors. Also, we adopt a single output node with linear activation function to predict the water level at “Gudam” after D hours. Therefore, the MLP architecture is 12 inputs, 40 hidden nodes with tanh(.) activation function, and one linear output node. After initializing the MLP with random weights uniformly distributed on [-1×10-4 , 1×10-4] , EBP algorithm with learning rate of 0.01 updates the weights vkj ’s and wji ’s to minimize MSE for training samples. We also estimate MSE for test samples during 10,000 iterations of MLP learning.

Firstly, we train the MLP with D=1 and plot the MSEs for training and test samples during learning in Fig. 3. After rapid reduction of MSEs in the initial stage of learning, MSEs decrease very slowly. There is a gap between MSE for training samples and that for test samples, since we never use test samples to train MLPs.

Fig. 3.MSEs(Mean-Squared Errors) for training and test samples during training of MLP to predict the water level at “Gudam” after one hour (D=1)

In order to verify the prediction performance of MLP after finish of learning, we plot the water level at “Gudam” in 2014 for whole period and its predicted value in Fig. 4(a). The horizontal axis has 6600 data points which corresponds to the whole period of 2014 from March 1st to November 30th. For easy readability of the horizontal axis, we include the index “month/day” just above the data index of horizontal axis in Fig. 4(a). Also, we plot detail curves of the real and predicted ones in the periods of March 21st to 25th and August 16th to 20th - periods with very high peaks- in Fig. 4(b) and (c), respectively. Considering that the data in 2014 is not used for training, the prediction for whole period is successful.

Fig. 4.The water level at “Gudam” in 2014 and its predicted value by the MLP after 10,000 iteration of learning with D=1. (a) March 1st ~ November 30th, (b) March 21st 20:00~25th 24:00, (c) August 16th 18:00 ~ 20th 22:00

We can estimate the statistical relationship between real data denoted by t and its predicted one y with the correlation coefficient defined by

where E[.] is the expectation operator and σt and σy are standard deviations of t and y, respectively. If the correlation coefficient is one, we can say that t and y have perfectly same tendency. Estimation of the correlation coefficient between the water level at “Gudam” and its prediction in Fig. 4(a) is 0.99692, which is a very high value.

Secondly, we train the MLP with D=2 to verify the prediction performance after two hours. Fig. 5 shows the MSEs of two-hour prediction learning, which is inferior to those in Fig. 3. Fig. 6 shows the water level at “Gudam” and its two-hour prediction value. The general prediction curve for whole period is in Fig. 6(a). And the results for detail periods of March 21st to 25th and August 16th to 20th are in Fig. 6(b) and (c), respectively. Fig. 6 shows that the predicted values are near to real values, although they are inferior to the results for D=1 in Fig. 4. Also, the correlation coefficient after 10,000 iteration training of MLP with D=2 is 0.99116, which is slightly less than 0.99692 for D=1. This is because the two-hour prediction problem is more difficult than the one-hour prediction problem. However, the prediction in the period of the highest water level (Fig. 6(c)) is near to the real water level values.

Fig. 5.MSEs during training of MLP with D=2

Fig. 6.The water level at “Gudam” in 2014 and its predicted value by the MLP after 10,000 iteration of learning with D=2. (a) March 1st ~ November 30th, (b) March 21st 20:00 ~ 25th 24:00, (c) August 16th 18:00 ~ 20th 22:00

We also conducted the D-hour prediction simulations for D=3,4,5,6 and the correlation coefficient between the water level at “Gudam” in 2014 and its predicted value is plotted in Fig. 7. The correlation coefficient degrades rapidly when increasing D. Therefore, the proposed hydrological modelling shows good prediction of the water level near “Hahoe Village” within two hours and the prediction over three hours should be improved.

Fig. 7.The correlation coefficient vs. prediction hour after 10,000 iteration training of MLPs with D=1,2,3,4,5,6

In this paper, we model the relationship between the water level near “Hahoe Village” and rainfalls at upper reaches of the village. So, we can predict the water level after D hours with given rainfalls from current to previous L hours. If there is a forecast of heavy rainfalls in Andong region, it is possible to predict the water level and whether there is flooding or not. Furthermore, the water level during heavy rainy season will be an important information for the height of bank near the village.

 

4. CONCLUSIONS

Among many UNESCO world heritages in Korea, “Hahoe Village” in Andong region is adjacent to Nakdong River and it is important to monitor the water level near the village for protection from flooding.

In this paper, we developed a hydrological modelling in Andong region using MLP, which predicted the water level near “Hahoe Village” with rainfalls at the three upper reaches of the village-“Pungsan”, “Iljik”, and “Andong”. MLP consisted of twelve inputs, forty hidden nodes, and one linear output node. The twelve inputs corresponded to three rainfalls and the water level near “Hahoe Village” from current time n to previous n-2 hours. Also, the linear output node corresponded to the water level prediction after D hours. We simulated the hydrological modelling with D=1,2,3,4,5,6. For performance verification of the water level prediction, we estimated the correlation coefficient between the water level near “Hahoe Village” and its prediction. Also, we plotted the real and predicted water levels for comparison. For D=1 and 2, the hydrological modelling was successful. However, when D was greater than two, the prediction performance degraded very much.

References

  1. H.-W. Rho, Y.-S. Oh, and K. Kitagawa, “Concepts of Disaster Prevention Design for safety in the Future Society,” Int. Journal of Contents, vol. 10, no. 1, 2014, pp. 54-61. https://doi.org/10.5392/IJoC.2014.10.1.054
  2. H. L. Cloke and F. Pappenberger, “Ensemble flood forecasting: A review,” Journal of Hydrology, vol. 375, 2009, pp. 613-626. https://doi.org/10.1016/j.jhydrol.2009.06.005
  3. K. Hornik, M. Stinchcombe, and H. White, “Multilayer Feed-forward Networks are Universal Approximators,” Neural Networks, vol.2, 1989, pp. 359-366. https://doi.org/10.1016/0893-6080(89)90020-8
  4. K. Hornik, “Approximation Capabilities of Multilayer Feedforward Networks,” Neural Networks, vol.4, 1991, pp. 251-257. https://doi.org/10.1016/0893-6080(91)90009-T
  5. S. Suzuki, “Constructive Function Approximation by Three-Layer Artificial Neural Networks,” Neural Networks, vol. 11, 1998, pp. 1049-1058. https://doi.org/10.1016/S0893-6080(98)00068-9
  6. Y. LeCun, Y. Bengio, and G. Hinton, “Deep Learning,” Nature, vol. 521, 2015, pp. 436-444. https://doi.org/10.1038/nature14539
  7. L.-H. Feng anf J. Lu, “The practical research on flood forecasting based on artificial neural networks,” Expert Systems with Applications, vol. 37, 2010, pp. 2974-2977. https://doi.org/10.1016/j.eswa.2009.09.037
  8. C. W. Dawson, R. J. Abrahart, A. Y. Shamseldin, and R. L. Wilby, “Flood estimation at ungauged sites using artificial neural networks,” Journal of Hydrology, vol. 319, 2006, pp. 391-409. https://doi.org/10.1016/j.jhydrol.2005.07.032
  9. C. W. Dawson and R. L. Wilby, “Hydrological modelling using artificial neural networks,” Progress in Physical Geography, vol. 25, 2001, pp. 80-108. https://doi.org/10.1177/030913330102500104
  10. Y. Wei, W. Xu, Y. Fan, and H.-T. Tasi, “Artificial neural network based predictive method for flood disaster,” Computers & Industrial Engineering, vol. 42, 2002, pp. 383-390. https://doi.org/10.1016/S0360-8352(02)00047-5
  11. S. Raid, J. Mania, L. Bouchaou, and Y. Najjar, “Rainfall-runoff model using an artificial neural network approach,” Mathematical and Computer Modelling, vol. 40, 2004, pp. 839-846. https://doi.org/10.1016/j.mcm.2004.10.012
  12. S. Raid, J. Mania, L. Bouchaou, and Y. Najjar, “Predicting catchment flow in a semi-arid region via an artificial neural network,” Hydrological Processes, vol. 18, 2004, pp. 2387-2393. https://doi.org/10.1002/hyp.1469
  13. D. E. Rumelhart and J. L. McClelland, Parallel Distributed Processing, Cambridge, MA, 1986.
  14. S.-H. Oh, “Improving the error back-propagation algorithm with a modified error function,” IEEE Trans. Neural Networks, vol.8, 1997, pp. 799-803. https://doi.org/10.1109/72.572117