BIOLOGY ORIENTED TARGET SPECIFIC LITERATURE MINING FOR GPCR PATHWAY EXTRACTION

GPCR 경로 추출을 위한 생물학 기반의 목적지향 텍스트 마이닝 시스템

  • KIm, Eun-Ju (Natural Language Processing Lab, Department of CSE, Pohang University of Science and Technology(POSTECH)) ;
  • Jung, Seol-Kyoung (Natural Language Processing Lab, Department of CSE, Pohang University of Science and Technology(POSTECH)) ;
  • Yi, Eun-Ji (Natural Language Processing Lab, Department of CSE, Pohang University of Science and Technology(POSTECH)) ;
  • Lee, Gary-Geunbae (Natural Language Processing Lab, Department of CSE, Pohang University of Science and Technology(POSTECH)) ;
  • Park, Soo-Jun (Bioinformatics Research Team, Computer and Software Research Lab, ETRI)
  • Published : 2003.10.31

Abstract

Electronically available biological literature has been accumulated exponentially in the course of time. So, researches on automatically acquiring knowledge from these tremendous data by text mining technology become more and more prosperous. However, most of the previous researches are technology oriented and are not well focused in practical extraction target, hence result in low performance and inconvenience for the bio-researchers to actually use. In this paper, we propose a more biology oriented target domain specific text mining system, that is, POSTECH bio-text mining system (POSBIOTM), for signal transduction pathway extraction, especially for G protein-coupled receptor (GPCR) pathway. To reflect more domain knowledge, we specify the concrete target for pathway extraction and define the minimal pathway domain ontology. Under this conceptual model, POSBIOTM extracts interactions and entities of pathways from the full biological articles using a machine learning oriented extraction method and visualizes the pathways using JDesigner module provided in the system biology workbench (SBW) [14]

Keywords