Development of Multidimensional Analysis System for Bio-pathways

바이오 패스웨이 다차원 분석 시스템 개발

  • 서동민 (한국과학기술정보연구원 소프트웨어연구센터 과학기술마이닝팀) ;
  • 최윤수 (한국과학기술정보연구원 소프트웨어연구센터 과학기술마이닝팀) ;
  • 전선희 (한국과학기술정보연구원 소프트웨어연구센터 과학기술마이닝팀) ;
  • 이민호 (한국과학기술정보연구원 소프트웨어연구센터 과학기술마이닝팀)
  • Received : 2014.10.16
  • Accepted : 2014.11.04
  • Published : 2014.11.28


With the development of genomics, wearable device and IT/NT, a vast amount of bio-medical data are generated recently. Also, healthcare industries based on big-data are booming and big-data technology based on bio-medical data is rising rapidly as a core technology for improving the national health and aged society. A pathway is the biological deep knowledge that represents the relations of dynamics and interaction among proteins, genes and cells by a network. A pathway is wildly being used as an important part of a bio-medical big-data analysis. However, a pathway analysis requires a lot of time and effort because a pathway is very diverse and high volume. Also, multidimensional analysis systems for various pathways are nonexistent even now. In this paper, we proposed a pathway analysis system that collects user interest pathways from KEGG pathway database that supports the most widely used pathways, constructs a network based on a hierarchy structure of pathways and analyzes the relations of dynamics and interaction among pathways by clustering and selecting core pathways from the network. Finally, to verify the superiority of our pathway analysis system, we evaluate the performance of our system in various experiments.


KEGG Pathway;Network Analysis;Big-Data;Cluster


Grant : 고성능 컴퓨팅 기반 빅데이터 기술 개발

Supported by : 한국과학기술정보연구원, 한국연구재단


  1. 서동민, 정한민, "빅데이터 분석 서비스 지원을 위한 지능형 웹 크롤러", 한국콘텐츠학회논문지, 제13권, 제12호, pp.575-584, 2013.
  2. 성원경, 이상환, 정한민, 박경석, 이승우, 김선태, 황미녕, 조민희, 과학기술 빅데이터 추진과제 발굴 및 활용 극대화를 위한 추진전략 마련 기획연구, 교육과학기술부, 2013.
  3. 윤미영, 권정은, 빅데이터로 진화하는 세상 - Big Data 글로벌 선진 사례, 한국정보화진흥원, 2012.
  6. 백인수, 박지혜, "데이터 시대: 데이터 분석의 중요성", IT&Future Strategy, 제9호, p.12, 2013.
  8. 이재권, 강태호, 이영훈, 유재수, "단백질 경로 분석 시스템의 설계 및 구현", 한국콘텐츠학회논문지, 제5권, 제6호, pp.31-40, 2005.
  9. S. J. Cho, J. W. Ryu, and J. S. Yoo, "Analysis of KEGG Flows Network Based on Protein-protein Interaction Networks," Proc. IDDIE, pp.215-216, 2011.
  10. L. P. Cordella, P. Foggia, C. Sansone, and M. Vento, "An Improved Algorithm for Matching Large Graphs," 3rd IAPR-TC15 Workshop on Graph-based Representations in Pattern Recognition, Cuen, pp.149-159, 2001.