DOI QR코드

DOI QR Code

A Design of SOA-based Data Integration Framework for Effective Spatial Data Mining

효과적인 공간 데이터 마이닝을 위한 SOA 기반 데이터 통합 프레임워크 설계

  • 문일환 (한경대학교 컴퓨터공학과) ;
  • 허환 (경기동부과수농협) ;
  • 김삼근 (한경대학교 컴퓨터공학과)
  • Received : 2011.05.02
  • Accepted : 2011.08.03
  • Published : 2011.10.31

Abstract

Recently, the concern of IT-in-Agriculture convergence technology that combines information technology and agriculture is increasing rapidly. Especially, the crop cultivation related prediction services by spatial data mining (SDM) can play an important role in reducing the damage of natural disaster and enhancing crop productivity. However, the data conversion and integration procedure to acquire the learning dataset of SDM for the prediction service need a lot of effort and time, because of their heterogeneity between distributed data. In addition, calculating spatial neighborhood relationships between spatial and non-spatial data necessitates requires the complicated calculation procedure for large dataset. In this paper, we suggest a SOA-based data integration framework that can effectively integrate distributed heterogeneous data by treating each data source as a service unit and support to find the optimal prediction service by improving productivity of learning dataset for SDM. In our experiment, we confirmed that our framework can be effectively applied to find the optimal prediction service for the frost damage area, by considering the case of peach crop cultivation in Icheon in Korea.

최근 농업 분야에 IT를 접목시킨 농업-IT 융합 기술에 대한 연구가 주목 받고 있다. 특히, 공간 데이터 마이닝(spatial data mining, SDM)을 이용한 농작물 관련 예측 서비스들을 통해 자연재해에 대한 피해를 줄이고 농작물의 생산성을 높이고자 하는 연구들이 있어 왔다. 그러나 예측 서비스를 위한 SDM에 필요한 학습 데이터는 분산되어 있는 데이터간의 이질성으로 인해 데이터 변환과 통합과정에 많은 비용과 시간이 발생한다. 또한 공간 데이터와 비공간 데이터 간의 공간적 이웃 관계를 연산하기 위해 대용량의 데이터에 대한 복잡한 연산과정이 필요하다. 본 논문에서는 각각의 데이터 소스를 하나의 서비스 단위로 취급함으로써 분산된 이질적인 데이터를 효과적으로 통합 관리할 수 있고 SDM을 위한 학습 데이터의 생산성을 향상시켜 최적의 예측 서비스의 발견을 지원해 주는 SOA 기반의 데이터 통합 프레임워크를 제안한다. 실험을 통해 경기도 이천시의 복숭아나무의 동해 피해지역에 대한 최적의 예측 서비스의 발견을 위해 제안 프레임워크를 효과적으로 적용할 수 있음을 확인하였다.

Keywords

References

  1. Reddy, P. K. and Ankaiah, R., "A framework of information technology‐based agriculture information dissemination system to improve crop productivity," Current science, Vol.88, No.12, pp.1905-1913, 2005.
  2. Fraisse, C.W., Breuer, N.E., Zierden, D., Bellow, J.G., Paz, J., Cabrera, V.E., Garcia y Garcia, A., Ingram, K.T., Hatch, U., Hoogenboom, G., Jones, J.W., "AgClimate: A climate forecast information system for agricultural risk management in the southeastern USA," Computers and electronics in agriculture, Vol.53, No.1, pp.13-27, 2006. https://doi.org/10.1016/j.compag.2006.03.002
  3. Han, J. and Micheline, K., "Data Mining: Concepts and Techniques," Morgan Kaufmann Publishers, 2001.
  4. Ester, M., Kriegel, H.‐P., and J. Sander, "Algorithms and Applications for Spatial Data Mining," In H. J. Miller and J. Han, editors, Geographic Data Mining and Knowledge Discovery, 2001.
  5. Scheibler, T., Mietzner, R. and Leymann, F., "EAI as a Service - Combining the Power of Executable EAI Patterns and SaaS", Enterprise Distributed Object Computing Conference, pp.107-116, 2008.
  6. Huamin, W. and Zhiwei, Y., "An ETL Services Framework Based on Metadata", Intelligent Systems and Applications (ISA), 2nd International Workshop on, pp.1-4, 2010.
  7. Thomas, E., "Service‐Oriented Architecture: Concepts, Technology, and Design," Prentice Hall, PTR, 2005.
  8. Roy S., "The New Integration Scenario: Five Trends That Change How application Software Work," Gartner Application Integration and Web Service Summit, 2005.
  9. Xu, H., Hongqi, L., Qiaoyan, D. Zhuang, W., "The SOA‐Based Solution for Distributed Enterprise Application Integration," Computer Science‐Technology and Applications, International Forum, Vol.3, pp.330-336, 2009.
  10. Sha, Z. and Xie, Y., "Design of service‐oriented architecture for spatial data integration and its application in building web ‐based GIS systems," Geo‐Spatial Information Science, Vol.13, No.1, pp.8-15, 2010. https://doi.org/10.1007/s11806-010-0163-7
  11. Haitao D., Bo Z. and Dingfang C., "Design and Actualization of SOA‐based Data Mining System," Computer‐Aided Industrial Design and Conceptual Design, 9th International Conference, pp.22-25, 2008.
  12. Awad M.M.I. and Abdullah M.S., "A framework for interoperable distributed ETL components based on SOA," Software Technology and Engineering(ICSTE), 2nd international conference, pp.67-70, 2010.
  13. Han S. and Kim J., "Rough set‐based decision‐tree using a core attribute," Int. J. Inf. Technol. Decisi. Mak., Vol.7, No.2, pp.275-290, 2008. https://doi.org/10.1142/S0219622008002946
  14. Hu Y. and Tseng F., "Mining simplified fuzzy if‐then rules for pattern classification," Int. J. Inf. Technol. Decisi. Mak., Vol.8, No.3, pp.473-489, 2009. https://doi.org/10.1142/S021962200900348X
  15. Peng Y., Kou G., Shi Y. and Chen Z., "A descriptive framework for the field of data mining and knowledge discovery," Int. J. Inf. Technol. Decisi. Mak., Vol.7, No.4, pp.639-682, 2008. https://doi.org/10.1142/S0219622008003204
  16. Witten, I. H., Frank, E. and Hall, M. A., "Data Mining: Practical Machine Learning Tools and Techniques," 3rd Ed., Morgan Kaufmann Publishers, 2011.
  17. L. Aijun, L. Yunhui and L. Siwei, Mapping a decision-tree for classification into a neural network, Proc. 7th Int. Conf. on Computational Intelligence & Natural Computing, pp.1528-1531, 2003.
  18. M. Kim, H. Na, K. Chae, H. Bang and J. Na, A Combined Data Mining Approach for DDoS Attack Detection, Lecture Notes in Computer Science (LNCS) Vol.3090, pp.943-950, 2004. https://doi.org/10.1007/978-3-540-25978-7_95

Cited by

  1. Freeze Risk Assessment for Three Major Peach Growing Areas under the Future Climate Projected by RCP8.5 Emission Scenario vol.14, pp.3, 2012, https://doi.org/10.5532/KJAFM.2012.14.3.124