DOI QR코드

DOI QR Code

The Distributed Encryption Processing System for Large Capacity Personal Information based on MapReduce

맵리듀스 기반 대용량 개인정보 분산 암호화 처리 시스템

  • Received : 2013.10.24
  • Accepted : 2013.12.04
  • Published : 2014.03.31

Abstract

Collecting and utilizing have a huge amount of personal data have caused severe security issues such as leakage of personal information. Several encryption algorithms for collected personal information have been widely adopted to prevent such problems. In this paper, a novel algorithm based on MapReduce is proposed for encrypting such private information. Furthermore, test environment has been built for the performance verification of the distributed encryption processing method. As the result of the test, average time efficiency has improved to 15.3% compare to encryption processing of token server and 3.13% compare to parallel processing.

대량의 개인정보가 수집되어 활용됨에 따라 개인정보 유출 등의 보안 문제가 발생하고 있다. 이에 최근에는 수집된 개인정보를 암호화 하여 저장하고 활용하는 방법이 사용되고 있다. 본 논문에서는 기존에 수집된 대량의 개인정보를 단시간에 암호화하기 위한 방법으로 맵리듀스 기반의 분산 암호화 처리 방법을 제안하고, 시스템을 설계하고 구현하였다. 또한 맵리듀스 기반의 분산 암호화 처리 방법의 성능을 검증을 위해 테스트 환경을 구축하여 비교 실험을 진행하였다. 실험 결과 토큰 서버의 암호화 처리 시간이 순차처리 대비 평균 시간 효율이 약 15.3% 정도 향상 하였으며, 병렬처리대비 약 3.13%정도 향상되는 것을 확인 하였다.

Keywords

References

  1. Y. J. Song, K. Y. Park, H. J. Kim, J. M. Do, and D. H. Lee, "A study on the secret sharing scheme for managing a large quantity of data including individual information," Dongguk University, KISA : CA, Report KISA-RP-2009-0013, Sep. 2009.
  2. J. W. Kang, "A Study of Effective Privacy Protection System on High Concurrent Transaction Database System", Convergence Security Journal, vol. 12, no. 2, pp.107-113, May. 2012.
  3. J. H. Hong, I. R. Jeong, "A case study on the performance of encrypted data", Journal of the Korea Institute of Information Security and Cryptology, vol. 22, no. 6, pp.1301-1313, Dec. 2012.
  4. Ken Mann, m. Tim Jones. (2008, December). Distributed computing with Linux and Hadoop [Internet]. Available: http://www.ibm.com/developerworks/linux/library/l-hadoop
  5. T. H. Kum, "Design and Implementation of A Monitor for Hadoop Cluster", M.S. dissertation, Computer Engineering, Hanyang University, Feb.2011.
  6. D. Borthaku. (2013, April). HDFS Architecture Guide. The Apache Software Foundation [Online]. Available: http://hadoop.apache.org/docs/stable/hdfs_ design.html
  7. J. H. Kwak, J. W. Yoon, Y. H. Jung, J. g. Hahm, D. I. Park, "Large-scale Data Analysis based on Hadoop for Astroinformatics", Journal of KIISE : Computing Practices and Letters, vol. 17, no. 11, pp.587-591, Nov. 2011.
  8. J. W. Lee, S.K. Kim, "Complementary research and Analysis for hadoop" in The Korea Society of Computer and Information Winter Conference 2012, vol. 20, no. 2, pp.3- 6, July. 2012
  9. E. M. Park, "A Study on Database Encryption Scheme for Privacy Protection under Packaged Software Environment", M.S. dissertation, Hanyang Cyber University, Feb. 2013.
  10. J. K. Heo, "Web Application Security using Distributed Encipherment", The Korea Contents Association Journal, vol. 8, no. 4, pp.10-16, Apr, 2008. https://doi.org/10.5392/JKCA.2008.8.4.010

Cited by

  1. A Study on the Improving Performance of Massively Small File Using the Reuse JVM in MapReduce vol.18, pp.9, 2015, https://doi.org/10.9717/kmms.2015.18.9.1098
  2. A Method of Distributed Parallel Processing based on Multi-Server for Improving Encryption Performance vol.19, pp.3, 2015, https://doi.org/10.6109/jkiice.2015.19.3.529
  3. 암호화와 감사 로깅에서 보안 요건 정의 연구 vol.19, pp.9, 2014, https://doi.org/10.9708/jksci.2014.19.9.085
  4. 분산병렬처리 환경에서 오토매핑 기법을 통한 NoSQL과 RDBMS와의 연동 vol.21, pp.11, 2014, https://doi.org/10.6109/jkiice.2017.21.11.2067