DOI QR코드

DOI QR Code

Network Graph Analysis of Gene-Gene Interactions in Genome-Wide Association Study Data

  • Lee, Sungyoung (Interdisciplinary Program in Bioinformatics, Seoul National University) ;
  • Kwon, Min-Seok (Interdisciplinary Program in Bioinformatics, Seoul National University) ;
  • Park, Taesung (Interdisciplinary Program in Bioinformatics, Seoul National University)
  • Received : 2012.10.31
  • Accepted : 2012.11.16
  • Published : 2012.12.31

Abstract

Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs). For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR) is one of the powerful and efficient methods for detecting high-order gene-gene ($G{\times}G$) interactions. However, the biological interpretation of $G{\times}G$ interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified $G{\times}G$ interactions. The proposed network graph analysis consists of three steps. The first step is for performing $G{\times}G$ interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified $G{\times}G$ interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE) data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform $G{\times}G$ interaction analysis of body mass index (BMI). Our network graph analysis successfully showed that many identified $G{\times}G$ interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of $G{\times}G$ interactions.

Keywords

References

  1. Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet 2005;6: 95-108.
  2. Cho YS, Go MJ, Kim YJ, Heo JY, Oh JH, Ban HJ, et al. A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat Genet 2009;41:527-534. https://doi.org/10.1038/ng.357
  3. Weedon MN, Lango H, Lindgren CM, Wallace C, Evans DM, Mangino M, et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet 2008;40: 575-583. https://doi.org/10.1038/ng.121
  4. Voight BF, Scott LJ, Steinthorsdottir V, Morris AP, Dina C, Welch RP, et al. Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis. Nat Genet 2010; 42:579-589. https://doi.org/10.1038/ng.609
  5. Newton-Cheh C, Johnson T, Gateva V, Tobin MD, Bochud M, Coin L, et al. Genome-wide association study identifies eight loci associated with blood pressure. Nat Genet 2009;41:666-676. https://doi.org/10.1038/ng.361
  6. Hill JO, Peters JC. Environmental contributions to the obesity epidemic. Science 1998;280:1371-1374. https://doi.org/10.1126/science.280.5368.1371
  7. Ichihara S, Yamada Y. Genetic factors for human obesity. Cell Mol Life Sci 2008;65:1086-1098. https://doi.org/10.1007/s00018-007-7453-8
  8. Hofker M, Wijmenga C. A supersized list of obesity genes. Nat Genet 2009;41:139-140. https://doi.org/10.1038/ng0209-139
  9. Feitosa MF, Borecki IB, Rich SS, Arnett DK, Sholinsky P, Myers RH, et al. Quantitative-trait loci influencing body-mass index reside on chromosomes 7 and 13: the National Heart, Lung, and Blood Institute Family Heart Study. Am J Hum Genet 2002;70:72-82. https://doi.org/10.1086/338144
  10. Farooqi IS, O'Rahilly S. Genetic factors in human obesity. Obes Rev 2007;8 Suppl 1:37-40.
  11. Awaya T, Yokosaki Y, Yamane K, Usui H, Kohno N, Eboshida A. Gene-environment association of an ITGB2 sequence variant with obesity in ethnic Japanese. Obesity (Silver Spring) 2008;16:1463-1466. https://doi.org/10.1038/oby.2008.68
  12. Frayling TM, Timpson NJ, Weedon MN, Zeggini E, Freathy RM, Lindgren CM, et al. A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity. Science 2007;316:889-894. https://doi.org/10.1126/science.1141634
  13. Scuteri A, Sanna S, Chen WM, Uda M, Albai G, Strait J, et al. Genome-wide association scan shows genetic variants in the FTO gene are associated with obesity-related traits. PLoS Genet 2007;3:e115. https://doi.org/10.1371/journal.pgen.0030115
  14. Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, Parl FF, et al. Multifactor-dimensionality reduction reveals highorder interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet 2001;69:138-147. https://doi.org/10.1086/321276
  15. Julia A, Moore J, Miquel L, Alegre C, Barceló P, Ritchie M, et al. Identification of a two-loci epistatic interaction associated with susceptibility to rheumatoid arthritis through reverse engineering and multifactor dimensionality reduction. Genomics 2007;90:6-13. https://doi.org/10.1016/j.ygeno.2007.03.011
  16. Brassat D, Motsinger AA, Caillier SJ, Erlich HA, Walker K, Steiner LL, et al. Multifactor dimensionality reduction reveals gene-gene interactions associated with multiple sclerosis susceptibility in African Americans. Genes Immun 2006;7:310-315. https://doi.org/10.1038/sj.gene.6364299
  17. Lou XY, Chen GB, Yan L, Ma JZ, Mangold JE, Zhu J, et al. A combinatorial approach to detecting gene-gene and gene-environment interactions in family studies. Am J Hum Genet 2008;83:457-467. https://doi.org/10.1016/j.ajhg.2008.09.001
  18. Lee SY, Oh SH, Kwon MS, Lee SY, Park TS. Two-way interaction analysis of obesity trait from Korean population using generalized MDR. In: IEEE International Conference on Bioinformatics and Biomedicine Workshops (Di Bernardo D, Li GZ, Chan TF, Luo B, Chen J, Michalowski M, eds.), 2010 Dec 18-21, Hong Kong, pp. 353-358.
  19. Kwon MS, Kim K, Lee S, Park T. cuGWAM: Genome-wide association multifactor dimensionality reduction using CUDAenabled high-performance graphics processing unit. Int J Data Min Bioinform 2012;6:471-481. https://doi.org/10.1504/IJDMB.2012.049301
  20. Rabbee N, Speed TP. A genotype calling algorithm for affymetrix SNP arrays. Bioinformatics 2006;22:7-12. https://doi.org/10.1093/bioinformatics/bti741
  21. Yu W, Gwinn M, Clyne M, Yesupriya A, Khoury MJ. A navigator for human genome epidemiology. Nat Genet 2008;40: 124-125. https://doi.org/10.1038/ng0208-124
  22. Kozomara A, Griffiths-Jones S. miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res 2011; 39:D152-D157. https://doi.org/10.1093/nar/gkq1027
  23. Macintyre G, Bailey J, Haviv I, Kowalczyk A. is-rSNP: a novel technique for in silico regulatory SNP detection. Bioinformatics 2010;26:i524-i530. https://doi.org/10.1093/bioinformatics/btq378
  24. Obayashi T, Kinoshita K. COXPRESdb: a database to compare gene coexpression in seven model animals. Nucleic Acids Res 2011;39:D1016-D1022. https://doi.org/10.1093/nar/gkq1147
  25. Velez DR, White BC, Motsinger AA, Bush WS, Ritchie MD, Williams SM, et al. A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet Epidemiol 2007;31:306-315. https://doi.org/10.1002/gepi.20211
  26. Bordicchia M, Battistoni I, Mancinelli L, Giannini E, Refi G, Minardi D, et al. Cannabinoid CB1 receptor expression in relation to visceral adipose depots, endocannabinoid levels, microvascular damage, and the presence of the Cnr1 A3813G variant in humans. Metabolism 2010;59:734-741. https://doi.org/10.1016/j.metabol.2009.09.018
  27. Vogel CI, Greene B, Scherag A, Muller TD, Friedel S, Grallert H, et al. Non-replication of an association of CTNNBL1 polymorphisms and obesity in a population of Central European ancestry. BMC Med Genet 2009;10:14.
  28. Heard-Costa NL, Zillikens MC, Monda KL, Johansson A, Harris TB, Fu M, et al. NRXN3 is a novel locus for waist circumference: a genome-wide association study from the CHARGE Consortium. PLoS Genet 2009;5:e1000539. https://doi.org/10.1371/journal.pgen.1000539
  29. Liu YJ, Guo YF, Zhang LS, Pei YF, Yu N, Yu P, et al. Biological pathway-based genome-wide association analysis identified the vasoactive intestinal peptide (VIP) pathway important for obesity. Obesity (Silver Spring) 2010;18:2339-2346. https://doi.org/10.1038/oby.2010.83
  30. Huang da W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009;37:1-13. https://doi.org/10.1093/nar/gkn923
  31. Buchner DA, Yazbek SN, Solinas P, Burrage LC, Morgan MG, Hoppel CL, et al. Increased mitochondrial oxidative phosphorylation in the liver is associated with obesity and insulin resistance. Obesity (Silver Spring) 2011;19:917-924. https://doi.org/10.1038/oby.2010.214
  32. Neuman RJ, Wasson J, Atzmon G, Wainstein J, Yerushalmi Y, Cohen J, et al. Gene-gene interactions lead to higher risk for development of type 2 diabetes in an Ashkenazi Jewish population. PLoS One 2010;5:e9903. https://doi.org/10.1371/journal.pone.0009903
  33. Yu HH, Liu PH, Lin YC, Chen WJ, Lee JH, Wang LC, et al. Interleukin 4 and STAT6 gene polymorphisms are associated with systemic lupus erythematosus in Chinese patients. Lupus 2010;19:1219-1228. https://doi.org/10.1177/0961203310371152