DOI QR코드

DOI QR Code

BioSMACK: a linux live CD for genome-wide association analyses

  • Hong, Chang-Bum (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Kim, Young-Jin (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Moon, Sang-Hoon (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Shin, Young-Ah (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Go, Min-Jin (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Kim, Dong-Joon (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Lee, Jong-Young (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health) ;
  • Cho, Yoon-Shin (Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health)
  • Received : 2011.08.05
  • Accepted : 2011.09.29
  • Published : 2012.01.31

Abstract

Recent advances in high-throughput genotyping technologies have enabled us to conduct a genome-wide association study (GWAS) on a large cohort. However, analyzing millions of single nucleotide polymorphisms (SNPs) is still a difficult task for researchers conducting a GWAS. Several difficulties such as compatibilities and dependencies are often encountered by researchers using analytical tools, during the installation of software. This is a huge obstacle to any research institute without computing facilities and specialists. Therefore, a proper research environment is an urgent need for researchers working on GWAS. We developed BioSMACK to provide a research environment for GWAS that requires no configuration and is easy to use. BioSMACK is based on the Ubuntu Live CD that offers a complete Linux-based operating system environment without installation. Moreover, we provide users with a GWAS manual consisting of a series of guidelines for GWAS and useful examples. BioSMACK is freely available at http://ksnp.cdc.go.kr/biosmack.

Keywords

References

  1. Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A., Bender, D., Maller, J., Sklar, P., de Bakker, P. I., Daly, M. J. and Sham, P. C. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559-575. https://doi.org/10.1086/519795
  2. Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A. and Reich, D. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet 38, 904-909. https://doi.org/10.1038/ng1847
  3. Pritchard, J. K., Stephens, M. and Donnelly, P. (2000) Inference of population structure using multilocus genotype data. Genetics. 155, 945-959.
  4. Clayton, D. and Leung, H. T. (2007) An R package for analysis of whole-genome association studies. Hum. Hered. 64, 45-51. https://doi.org/10.1159/000101422
  5. Vetrivel, U. and Pilla, K. (2008) Open discovery: An integrated live Linux platform of Bioinformatics tools. Bioinformation 3, 144-146. https://doi.org/10.6026/97320630003144
  6. Estrada, K., Abuseiris, A., Grosveld, F. G., Uitterlinden, A. G., Knoch, T. A. and Rivadeneira, F. (2009) GRIMP: a web- and grid-based tool for high-speed analysis of large-scale genome-wide association using imputed data. Bioinformatics 25, 2750-2752. https://doi.org/10.1093/bioinformatics/btp497
  7. Geeleher, P., Morris, D., Hinde, J. P. and Golden, A. (2009) BioconductorBuntu: a Linux distribution that implements a web-based DNA microarray analysis server. Bioinformatics 25, 1438-1439. https://doi.org/10.1093/bioinformatics/btp165
  8. Thomson, R. C. (2009) PhyLIS: a simple GNU/Linux distribution for phylogenetics and phyloinformatics. Evol. Bioinform. Online 5, 91-95.
  9. Willer, C. J., Li, Y. and Abecasis, G. R. (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190-2191. https://doi.org/10.1093/bioinformatics/btq340
  10. Marchini, J., Howie, B., Myers, S., McVean, G. and Donnelly, P. (2007) A new multipoint method for genome- wide association studies by imputation of genotypes. Nat. Genet. 39, 906-913. https://doi.org/10.1038/ng2088
  11. Li, Y., Willer, C. J., Ding, J., Scheet, P. and Abecasis, G. R. (2010) MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet. Epidemiol 34, 816-834. https://doi.org/10.1002/gepi.20533