PreSPI: Protein-Protein Interaction Prediction Service System

PreSPI: 단백질 상호작용 예측 서비스 시스템

  • 한동수 (한국정보통신대학교 공학부) ;
  • 김홍숙 (한국전자통신연구원 이동통신연구단) ;
  • 장우혁 (한국정보통신대학교 공학부) ;
  • 이성독 (한국정보통신대학교 공학부)
  • Published : 2005.12.01

Abstract

With the recognition of the importance of computational approach for protein-protein interaction prediction, many techniques have been developed to computationally predict protein-protein interactions. However, few techniques are actually implemented and announced in service form for general users to readily access and use the techniques. In this paper, we design and implement a protein interaction prediction service system based on the domain combination based protein-protein interaction prediction technique, which is known to show superior accuracy to other conventional computational protein-protein interaction prediction methods. In the prediction accuracy test of the method, high sensitivity($77\%$) and specificity($95\%$) are achieved for test protein pairs containing common domains with teaming sets of proteins in a Yeast. The stability of the method is also manifested through the testing over DIP CORE, HMS-PCI, and TAP data. Performance, openness and flexibility are the major design goals and they are achieved by adopting parallel execution techniques, web Services standards, and layered architecture respectively. In this paper, several representative user interfaces of the system are also introduced with comprehensive usage guides.

References

  1. M. Deng, S. Metah, F. Sun and T. Chen, Inferring Domain-Domain Interactions from Protein-Protein Interactions. Genome Research, 12, 1540-1548, 2002 https://doi.org/10.1101/gr.153002
  2. A. J. Enright, I. Iliopoulos, N. C. Kyrpides and C. A. Ouzounis, Protein interaction maps for complete genomes based on gene fusion events. Nature, 402, 86-90, 1999 https://doi.org/10.1038/47056
  3. E. M. Marcotte, M. Pellegrini, H. L. Ng, D. W. Rice, T. O. Yeates and D. Eisenberg, Detecting protein function and protein-protein interactions from genome sequences. Science, 285, 751-753, 1999 https://doi.org/10.1126/science.285.5428.751
  4. S. Ng, Z. Zhang and S. Tan, Integrative approach for computationally inferring protein domain interactions. Bioinformatics, 19, 923-929, 2003 https://doi.org/10.1093/bioinformatics/btg118
  5. A. J. Enright and C. A. Ouzounis, Chapter 33: Protein-Protein Interactions-A Molecular Cloning Manual, Cold Spring Harbor Laboratory Press, Cold spring Harbor, NY, 2002
  6. J. R. Bock and D. A. Gough, Prediction of protein-protein interaction from primary structure, Bioinformatics, 17, 455-460, 2001 https://doi.org/10.1093/bioinformatics/17.5.455
  7. J. Wojcik and V. Schachter, Protein-Protein interaction map inference using interacting domain profile pairs. Bioinformatics, 17 Suppl., S296-S305, 2001 https://doi.org/10.1093/bioinformatics/17.suppl_1.S296
  8. D. S. Han, H. S. Kim, J. M. Seo, and W. H. Jang, A Domain Combination Based Probabilistic Framework for Protein-Protein Interaction Prediction, Genome Informatics, No. 14, 250-259, 2003
  9. D. S. Han, H. S. Kim, W. H. Jang, and S. D. Lee, Domain Combination Based Protein-Protein Interaction Possibility Ranking Method, Proc. of 4th IEEE Sym. on Bioinfo. and Bioeng., 434-441, May, 2004
  10. 한동수, 서정민, 김홍숙, 장우혁, 도메인 조합 기반 단백질-단백질 상호작용 확률 예측 틀, 정보과학회논문지:컴퓨팅의 실제, Vol. 10, No. 4, 299-308, August, 2004
  11. W3C Web Services Architecture Working Group, Web Services Architecture, World Wide Web Consortium, http://www.w3.org/TR/ws-arch/, Feb., 2004
  12. W3C Web Services Architecture Working Group, Web Services Architecture Requirements, World Wide Web Consortium, http://www.w3.org/TR/wsa-reqs/, Feb., 2004
  13. W3C Web Services Architecture Working Group, Web Services Architecture Usage Scenarios, World Wide Web Consortium, http://www.w3.org/TR/ws-arch-scenarios/, Feb., 2004
  14. R. Apweiler, et al., The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res., 29, 37-40, 2001 https://doi.org/10.1093/nar/29.1.37
  15. H. M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T. N. Bhat, H. Weissig, I. N. Shindyalov and P. E. Bourne, The Protein Data Bank. Nucleic Acids Res., 28, 235-242, 2000 https://doi.org/10.1093/nar/28.1.235
  16. C.M. Deane, L. Salwinski, I. Xenarios, and D. Eisenberg, Protein Interactions: Two methods for assessment of the reliability of high throughput observations, Mol. Cell. Proteomics, 1, 349-356, 2002 https://doi.org/10.1074/mcp.M100037-MCP200
  17. Y. Ho, et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature, 415, 180-183, 2002 https://doi.org/10.1038/415180a
  18. A. Gavin, et al., Functional organization of the yeast proteom by systematic analysis of protein complexes, Nature., 415, 141-147, 2002 https://doi.org/10.1038/415141a
  19. Ng, S.K., Zhang, Z., Tan, S.H., and Lin, K., 'InterDom: a database of putative interacting protein domains for validating predicted protein interactions and complexes,' Nucleic Acids Research, Vol. 31, No. 1, pp. 251-254, 2003 https://doi.org/10.1093/nar/gkg079
  20. I. Xenarios, E. Fernandez, L. Salwinski, X. J. Duan, M. J. Thompson, E. M. Marcotte and D. Eisenberg, DIP: The Database of Inter acting Proteins: 2001 update. Nucleic Acids Res., 29, 239-241, 2001 https://doi.org/10.1093/nar/29.1.239