Subscribe to RSS
DOI: 10.3414/ME9129
Prediction of Disease-associated Single Nucleotide Polymorphisms Using Virtual Genomes Constructed from a Public Haplotype Database
Publication History
Publication Date:
18 January 2018 (online)
Summary
Objectives: Simultaneous dealing of hundreds of thousands of single nucleotide polymorphisms (SNPs) in genome-wide association studies is laborious. The aim of our study is to develop a method to decrease the number of candidate SNPs prior to the genotyping of study subjects.
Methods: We created virtual genotype data on case and control subjects from data of the International HapMap Project by using haplotype-based simulation method. We repeated virtual case-control association studies and selected candidate SNPs. We applied the selected SNPs to previously published genetic casecontrol studies. Sensitivity to detect association with causative genes using our method was compared to the original studies and results using tag SNPs.
Results: We found a discrete distribution pattern of SNPs, which was able to produce significant results in case-control association studies. The number of candidate SNPs that we selected was 24.7% of the number of the original set of SNPs. We applied this method to previously published genetic case-control studies and found that the use of candidate SNPs improved the sensitivity for detecting significant alleles, both compared to the original studies and to the use of tag SNPs. The results were not affected by the difference of the diseases and genes involved.
Conclusions: Our simulation-based approach has advantages of reducing costs and improving performance to detect significant alleles. This method can be used without considering the specific disease and genes involved.
-
References
- 1 Zhang W, Shmulevich I. Computational and statistical approaches to genomics. Seattle: Springer; 2006
- 2 Wang WY, Barratt BJ, Clayton DG, Todd JA. Genome- wide association studies: theoretical and practical concerns. Nat Rev 2005; 6: 109-118.
- 3 Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev 2005; 6: 95-108.
- 4 Carlson CS, Eberle MA, Kruglyak L, Nickerson DA. Mapping complex disease loci in wholegenome association studies. Nature 2004; 429: 446-452.
- 5 Brors B. Microarray annotations and biological information on function. Methods Inf Med 2005; 44: 2005.
- 6 Hartmann O. Quality control for microarray experiments. Methods Inf Med 2005; 44: 557-563.
- 7 Wall JD, Pritchard JK. Haplotype blocks and linkage disequilibrium in the human genome. Nat Rev 2003; 4: 587-597.
- 8 de Bakker PI, Yelensky R, Pe’er I, Gabriel SB, Daly MJ, Altshuler D. Efficiency and power in genetic association studies. Nat Genet 2005; 37: 1217-1223.
- 9 Johnson GC, Esposito L, Barratt BJ, Smith AN, Heward J, Di Genova G, Ueda H, Cordell HJ, Eaves IA, Dudbridge F. et al. Haplotype tagging for the identification of common disease genes. Nat Genet 2001; 29: 233-237.
- 10 Zhang K, Akey JM, Wang N, Xiong M, Chakraborty R, Jin L. Randomly distributed crossovers may generate block-like patterns of linkage disequilibrium: an act of genetic drift. Hum Genet 2003; 113: 51-59.
- 11 Phillips MS, Lawrence R, Sachidanandam R, Morris AP, Balding DJ, Donaldson MA, Studebaker JF, Ankener WM, Alfisi SV, Kuo FS. et al. Chromosome-wide distribution of haplotype blocks and the role of recombination hot spots. Nat Genet 2003; 33: 382-387.
- 12 Martin ER, Lai EH, Gilbert JR, Rogala AR, Afshari AJ, Riley J, Finch KL, Stevens JF, Livak KJ, Slotterbeck BD. et al. SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease. Am J Hum Genet 2000; 67: 383-394.
- 13 The International HapMap Project. Nature 2003; 426: 789-796.
- 14 The International HapMap Project.. A haplotype map of the human genome. Nature 2005; 437: 1299-1320.
- 15 Stephens M, Donnelly P. A comparison of Bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet 2003; 73: 1162-1169.
- 16 Stephens M, Smith NJ, Donnelly P. A new statistical method for haplotype reconstruction from population data. Am J Hum Genet 2001; 68: 978-989.
- 17 Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M. et al. The structure of haplotype blocks in the human genome. Science 2002; 296: 2225-2229.
- 18 Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2005; 21: 263-265.
- 19 Kuwano R, Miyashita A, Arai H, Asada T, Imagawa M, Shoji M, Higuchi S, Urakami K, Kakita A, Takahashi H. et al. Dynamin-binding protein gene on chromosome 10q is associated with late-onset Alzheimer’s disease. Hum Mol Genet 2006; 15: 2170-2182.
- 20 Li M, Atmaca-Sonmez P, Othman M, Branham KE, Khanna R, Wade MS, Li Y, Liang L, Zareparsi S, Swaroop A. et al. CFH haplotypes without the Y402H coding variant show strong association with susceptibility to age-related macular degeneration. Nat Genet 2006; 38: 1049-1054.
- 21 Smyth DJ, Cooper JD, Bailey R, Field S, Burren O, Smink LJ, Guja C, Ionescu-Tirgoviste C, Widmer B, Dunger DB. et al. A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferoninduced helicase (IFIH1) region. Nat Genet 2006; 38: 617-619.
- 22 Mizuta I, Satake W, Nakabayashi Y, Ito C, Suzuki S, Momose Y, Nagai Y, Oka A, Inoko H, Fukae J. et al. Multiple candidate gene analysis identifies alpha-synuclein as a susceptibility gene for sporadic Parkinson’s disease. Hum Mol Genet 2006; 15: 1151-1158.
- 23 Valentonyte R, Hampe J, Huse K, Rosenstiel P, Albrecht M, Stenzel A, Nagy M, Gaede KI, Franke A, Haesler R. et al. Sarcoidosis is associated with a truncating splice site mutation in BTNL2. Nat Genet 2005; 37: 357-364.
- 24 Erlich PM, Lunetta KL, Cupples LA, Huyck M, Green RC, Baldwin CT, Farrer LA. et al. Polymorphisms in the PON gene cluster are associated with Alzheimer disease. Hum Mol Genet 2006; 15: 77-85.
- 25 Scott LJ, Bonnycastle LL, Willer CJ, Sprau AG, Jackson AU, Narisu N, Duren WL, Chines PS, Stringham HM, Erdos MR. et al. Association of transcription factor 7-like 2 (TCF7L2) variants with type 2 diabetes in a Finnish sample. Diabetes 2006; 55: 2649-2653.
- 26 Lucae S, Salyakina D, Barden N, Harvey M, Gagne B, Labbe M, Binder EB, Uhr M, Paez-Pereda M, Sillaber I. et al. P2RX7, a gene coding for a purinergic ligand-gated ion channel, is associated with major depressive disorder. Hum Mol Genet 2006; 15: 2438-2445.
- 27 Ozaki K, Tanaka T. Genome-wide association study to identify single-nucleotide polymorphisms conferring risk of myocardial infarction. Methods Mol Med 2006; 128: 173-180.
- 28 Monsuur AJ, de Bakker PI, Alizadeh BZ, Zhernakova A, Bevova MR, Strengman E, Franke L, van’t Slot R, van Belzen MJ, Lavrijsen IC. et al. Myosin IXB variant increases the risk of celiac disease and points toward a primary intestinal barrier defect. Nat Genet 2005; 37: 1341-1344.
- 29 Mori M, Yamada R, Kobayashi K, Kawaida R, Yamamoto K. Ethnic differences in allele frequency of autoimmune-disease-associated SNPs. J Hum Genet 2005; 50: 264-266.
- 30 Park J, Hwang S, Lee YS, Kim SC, Lee D. SNP@Ethnos: a database of ethnically variant single-nucleotide polymorphisms. Nucleic Acids Res 2007; 35: D711-D715.
- 31 Bertram L, Hiltunen M, Parkinson M, Ingelsson M, Lange C, Ramasamy K, Mullin K, Menon R, Sampson AJ, Hsiao MY. et al. Family-based association between Alzheimer’s disease and variants in UBQLN1. New Engl J Med 2005; 352: 884-894.
- 32 Smemo S, Nowotny P, Hinrichs AL, Kauwe JS, Cherny S, Erickson K, Myers AJ, Kaleem M, Marlowe L, Gibson AM. et al. Ubiquilin 1 polymorphisms are not associated with late-onset Alzheimer’s disease. Ann Neurol 2006; 59: 21-26.
- 33 Slifer MA, Martin ER, Bronson PG, Browning-Large C, Doraiswamy PM, Welsh-Bohmer KA, Gilbert JR, Haines JL, Pericak-Vance MA. et al. Lack of association between UBQLN1 and Alzheimer disease. Am J Med Genet B Neuropsychiatr Genet 2006; 141: 208-213.
- 34 Kamboh MI, Minster RL, Feingold E, DeKosky ST. Genetic association of ubiquilin with Alzheimer’s disease and related quantitative measures. Mol Psychiatry 2006; 11: 273-279.