The BIOWAY System: A Data Warehouse for Generalized Representation & Visualization of Bio-Pathways

  • Kim, Min Kyung (Department of Computer Science and Engineering, Ewha University) ;
  • Seo, Young Joo (Institute of Bioinformatics, Macrogen Inc., School of Computer Engineering, Sejong University) ;
  • Lee, Sang Ho (Department of Computer Science and Engineering, Ewha University) ;
  • Song, Eun Ha (Department of Computer Science and Engineering, Ewha University) ;
  • Lee, Ho Il (Institute of Bioinformatics, Macrogen Inc., School of Computer Engineering, Sejong University) ;
  • Ahn, Chang Shin (School of Computer Engineering, Sejong University) ;
  • Choi, Eun Chung (Department of Computer Science and Engineering, Ewha University) ;
  • Park, Hyun Seok (Department of Computer Science and Engineering, Ewha University, Institute of Bioinformatics, Macrogen Inc.)
  • Published : 2004.12.01

Abstract

Exponentially increasing biopathway data in recent years provide us with means to elucidate the large-scale modular organization of the cell. Given the existing information on metabolic and regulatory networks, inferring biopathway information through scientific reasoning or data mining of large scale array data or proteomics data get great attention. Naturally, there is a need for a user-friendly system allowing the user to combine large and diverse pathway data sets from different resources. We built a data warehouse - BIOWAY - for analyzing and visualizing biological pathways, by integrating and customizing resources. We have collected many different types of data in regards to pathway information, including metabolic pathway data from KEGG/LIGAND, signaling pathway data from BIND, and protein information data from SWISS-PROT. In addition to providing general data retrieval mechanism, a successful user interface should provide convenient visualization mechanism since biological pathway data is difficult to conceptualize without graphical representations. Still, the visual interface in the previous systems, at best, uses static images only for the specific categorized pathways. Thus, it is difficult to cope with more complex pathways. In the BIOWAY system, all the pathway data can be displayed in computer generated graphical networks, rather than manually drawn image data. Furthermore, it is designed in such a way that all the pathway maps can be expanded or shrinked, by introducing the concept of super node. A subtle graphic layout algorithm has been applied to best display the pathway data.

Keywords

References

  1. Bader, G.D., Betel, D., and Hogue, C.W. (2003). BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res. 31, 248-250 https://doi.org/10.1093/nar/gkg056
  2. Boeckmann, B., Bairoch, A., and Apweiler, R.M. (2003). The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365-370 https://doi.org/10.1093/nar/gkg095
  3. Clamp, M., Andrews, D., and Baker, D. (2003). Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res. 31, 38-42 https://doi.org/10.1093/nar/gkg083
  4. Kanehisa, M., Goto, S., Kawashima, S., and Nakaya, A. (2002). The KEGG databases at GenomeNet. NucleicAcids Res. 30, 42-46 https://doi.org/10.1093/nar/30.1.42
  5. Kim, M.K. and Park, H.S. (2002). Generalized Representation of metabolic and regulatory pathways. Genome Informatics 13, 351-352
  6. Kim, M.K., Park, H.S., and Yoo, S.J. (2003). Crosstalk between metabolic and regulatory pathways. Genome informatics 14, 372-373
  7. van Helden, J., Naim, A., Lemer, C., Mancuso, R., Eldridge, M., and Wodak, S.J. (2001). From molecular activities and processes to biological function. Brief Bioinform. 2, 81-93 https://doi.org/10.1093/bib/2.1.81
  8. Wong, L. (2002). Technologies for integrating biological data. Brief Bioinform. 3, 389-404 https://doi.org/10.1093/bib/3.4.389