DOI QR코드

DOI QR Code

Prediction for Periodontal Disease using Gene Expression Profile Data based on Machine Learning

기계학습 기반 유전자 발현 데이터를 이용한 치주질환 예측

  • Rhee, Je-Keun (Department of Life Science in Dentistry, School of Dentistry, Pusan National University)
  • Received : 2019.07.15
  • Accepted : 2019.07.30
  • Published : 2019.08.31

Abstract

Periodontal disease is observed in many adult persons. However we has not clear know the molecular mechanism and how to treat the disease at the molecular levels. Here, we investigated the molecular differences between periodontal disease and normal controls using gene expression data. In particular, we checked whether the periodontal disease and normal tissues would be classified by machine learning algorithms using gene expression data. Moreover, we revealed the differentially expression genes and their function. As a result, we revealed that the periodontal disease and normal control samples were clearly clustered. In addition, by applying several classification algorithms, such as decision trees, random forests, support vector machines, the two samples were classified well with high accuracy, sensitivity and specificity, even though the dataset was imbalanced. Finally, we found that the genes which were related to inflammation and immune response, were usually have distinct patterns between the two classes.

치주질환은 상당수의 성인들이 가지고 있는 질환이지만 아직 분자적인 수준에서의 발생 기작과 치료 방법에 대해서는 많은 것이 밝혀져 있지 않다. 본 연구에서는 치주질환 조직과 정상 조직에서 얻어진 유전자 발현 데이터를 이용하여 치주질환 조직과 정상 조직 사이에 분자적 차이가 있는지를 확인한다. 특히 기계학습 알고리즘을 이용하여 유전자 발현양 기반 치주질환 조직과 정상 조직의 분류가 가능한지를 확인하고, 각 조직에서 발현양 차이가 나는 유전자들이 주로 어떤 기능을 하는 것인지 살펴본다. t-SNE를 이용한 분석 결과 정상 조직과 치주질환 조직 샘플이 명확히 구분되어 군집화 될 수 있음이 확인되었다. 또한, 결정 트리, 랜덤 포레스트, 서포트 벡터 머신을 이용한 분류 알고리즘을 적용한 결과 불균형 데이터임에도 높은 정확도와 민감도, 특이도를 보였으며, 염증 반응 및 면역 반응 관련 유전자들이 주로 두 집단 간에 차이를 보임이 확인되었다.

Keywords

References

  1. K. Abbayya., N. Y. Puthanakar, S. Naduwinmani, and Y. S. Chidambar. "Association between periodontitis and Alzheimer's disease," North American Journal of Medical Sciences, vol. 7, no. 6, pp. 241-246, Jun. 2015. https://doi.org/10.4103/1947-2714.159325
  2. F. B. Teixeira, M. T. Saito, F. C. Matheus, R. D. Prediger, E. S. Yamada, C. S. F. Maia, and R. R. Lima. "Periodontitis and Alzheimer's Disease: A Possible Comorbidity between Oral Chronic Inflammatory Condition and Neuroinflammation," Frontiers in Aging Neuroscience, vol. 9, pp. 327, Oct 2017. https://doi.org/10.3389/fnagi.2017.00327
  3. D. F. Kinane, P. G. Stathopoulou and P. N. Papapanou, "Periodontal diseases," Nature Reviews Disease Primers. vol. 3 pp. 17038, Jun 2017. https://doi.org/10.1038/nrdp.2017.38
  4. J. H. Lee, D. H. Kim, S. N. Jeong, and S. H. Choi. "Diagnosis and prediction of periodontally compromised teeth using a deep learning-based convolutional neural network algorithm," Journal of Periodontal & Implant Science..vol. 48, no. 2. pp. 114-123, Apr. 2018. https://doi.org/10.5051/jpis.2018.48.2.114
  5. M. Kebschull, R. T. Demmer, B. Grun B, P. Guarnieri, P. Pavlidis, and P. N. Papapanou, "Gingival tissue transcriptomes identify distinct periodontitis phenotypes," Journal of Dental Research, vol. 93 no. 5, pp. 459-468. May 2014. https://doi.org/10.1177/0022034514527288
  6. G. P . Way and C. S. Greene, "Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders," in Proceedings of the Pacific Symposium on Biocomputing, Big Island of Hawaii, pp. 80-91, 2018.
  7. L. V. D. Maaten, and G. Hinton "Visualizing Data using t-SNE," Journal of Machine Learning Research, vol 9. pp. 2579-2605, Nov. 2008.
  8. J. H. Krijthe (2015). Rtsne: T-Distributed Stochastic Neighbor Embedding using a Barnes-Hut Implementation, Available: https://github.com/jkrijthe/Rtsne
  9. J. R. Quinlan. "Introduction of Decision Trees", Machine Learning, vol. 1, pp. 81-106, Mar. 1986. https://doi.org/10.1007/BF00116251
  10. T. K. Ho. "Random Decision Forests," Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, pp. 278-282, 1995.
  11. C. J. C. Burge, "A tutorial on support vector machines for pattern recognition," Data Mining and Knowledge Discovery, vol. 2, pp. 121-167, Jun. 1998. https://doi.org/10.1023/A:1009715923555
  12. R. Treventhan, "Sensitivity, Specificity, and Predictive Values: Foundations, Pliabilities, and Pitfalls in Research and Practice," Frontiers in Public Health, vol. 5, pp. 307, Nov. 2017. https://doi.org/10.3389/fpubh.2017.00307
  13. M. Miyauchi, M. Ao, H. Furusho, C. Chea, A. Nagasaki, S. Sakamoto, T. Ando, T. Inubushi, K. Kozai, and T. Takata, "Galectin-3 Plays an Important Role in Preterm Birth Caused by Dental Infection of Porphyromonas gingivalis," Scientific Reports, vol. 8, pp. 2867, Feb. 2018. https://doi.org/10.1038/s41598-018-21072-y
  14. P. K. Shetty, and T. N. Pattabiraman. "Salivary glycoproteins as indicators of oral diseases," Indian Journal of Clinical Biochemistry, vol. 19 no. 1, pp. 97-101. Jan. 2004. https://doi.org/10.1007/BF02872400
  15. A. Subramanian, P. Tamayo, V. K. Mootha, S. Mukherjee, B. L. Ebert, M. A. Gillette, A. Paulovich, S. L. Pomeroy, T. R. Golub, E. S. Lander, and J. P. Mesirov. "Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles," Proceedings of the National Academy of Sciences of the United States of America, vol. 102 no. 43, pp. 15545-15550. Sep 2005. https://doi.org/10.1073/pnas.0506580102
  16. Y. Benjamini and D. Yekutieli, "The Control of the False Discovery Rate in Multiple Testing under Dependency," The Annals of Statistics, vol. 29, no. 4, pp. 1165-1188. Aug. 2011. https://doi.org/10.1214/aos/1013699998
  17. P. Parthiban, J. Mahendra., "Toll-Like Receptors: A Key Marker for Periodontal Disease and Preterm Birth - A Contemporary Review," Journal of Clinical and Diagnostic Research, vol. 9, no. 9 pp. ZE14-17. Sep. 2015. https://doi.org/10.1111/crj.12098
  18. L. Sun, R. D. Ye, "Role of G protein-coupled receptors in inflammation.." Acta Pharmacologica Sinica, vol. 33 no. 3, pp. 342-350. Feb. 2012. https://doi.org/10.1038/aps.2011.200
  19. H.-H. Lin, H. Cheng-Chih, P. Caroline, H. Josee, S. Torsten , and H. Jorg. "Adhesion GPCRs in regulating immune responses and inflammation," Advances in immunology, vol. 136, pp. 163-201. Jan. 2017. https://doi.org/10.1016/bs.ai.2017.05.005