Nucleic Acids Research Advance Access first published online on March 6, 2007
This version published online on March 16, 2007
Nucleic Acids Research, doi:10.1093/nar/gkm076
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Computational Biology |
QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
1Genomics Laboratory and 4Bioinformatics, Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, 2Life Science Interface Doctoral Training Centre, Wolfson Building, Parks Road, Oxford OX1 3QD, 3Henry Wellcome Centre for Gene Function, Department of Statistics, University of Oxford, Oxford, OX1 3TG, 5Oxford Medical Genetics Laboratories, The Churchill Hospital, Oxford, OX3 7LJ, UK, 6Centre for Addiction & Mental Health, University of Toronto, 1001 Queen Street West, Toronto, Ontario M6J 1H4, Canada and 7MRC Mammalian Genetics Unit, Medical Research Council, Harwell, Oxford, OX11 0RD
*To whom correspondence should be addressed. Tel: +44-(0)1865 287526; Fax: +44-(0)1865 287533; Email: ioannis.ragoussis{at}well.ox.ac.uk
*Correspondence may also be addressed to Christopher C. Holmes. Tel: +44 (0)1865 285368; Fax: +44 (0)1865 285384; Email: cholmes{at}stats.ox.ac.uk
Received December 20, 2006. Revised January 24, 2007. Accepted January 25, 2007.
Array-based technologies have been used to detect chromosomal copy number changes (aneuploidies) in the human genome. Recent studies identified numerous copy number variants (CNV) and some are common polymorphisms that may contribute to disease susceptibility. We developed, and experimentally validated, a novel computational framework (QuantiSNP) for detecting regions of copy number variation from BeadArrayTM SNP genotyping data using an Objective Bayes Hidden-Markov Model (OB-HMM). Objective Bayes measures are used to set certain hyperparameters in the priors using a novel re-sampling framework to calibrate the model to a fixed Type I (false positive) error rate. Other parameters are set via maximum marginal likelihood to prior training data of known structure. QuantiSNP provides probabilistic quantification of state classifications and significantly improves the accuracy of segmental aneuploidy identification and mapping, relative to existing analytical tools (Beadstudio, Illumina), as demonstrated by validation of breakpoint boundaries. QuantiSNP identified both novel and validated CNVs. QuantiSNP was developed using BeadArrayTM SNP data but it can be adapted to other platforms and we believe that the OB-HMM framework has widespread applicability in genomic research. In conclusion, QuantiSNP is a novel algorithm for high-resolution CNV/aneuploidy detection with application to clinical genetics, cancer and disease association studies.
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. D. Greenman, G. Bignell, A. Butler, S. Edkins, J. Hinton, D. Beare, S. Swamy, T. Santarius, L. Chen, S. Widaa, et al. PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data Biostat., October 15, 2009; (2009) kxp045v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Lynch, M. Dunning, M. Iddawela, N. Barbosa-Morais, and M. Ritchie Considerations for the processing and analysis of GoldenGate-based two-colour Illumina platforms Statistical Methods in Medical Research, October 1, 2009; 18(5): 437 - 452. [Abstract] [PDF] |
||||
![]() |
S. L. Spain, J.-B. Cazier, The CORGI Consortium,, R. Houlston, L. Carvajal-Carmona, and I. Tomlinson Colorectal Cancer Risk Is Not Associated with Increased Levels of Homozygosity in a Population from the United Kingdom Cancer Res., September 15, 2009; 69(18): 7422 - 7429. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Winchester, C. Yau, and J. Ragoussis Comparing CNV detection methods for SNP arrays Brief Funct Genomic Proteomic, September 8, 2009; (2009) elp017v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Sun, F. A. Wright, Z. Tang, S. H. Nordgard, P. V. Loo, T. Yu, V. N. Kristensen, and C. M. Perou Integrated study of copy number states and genotype calls using high-density SNP arrays Nucleic Acids Res., September 1, 2009; 37(16): 5365 - 5377. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yoon, Z. Xuan, V. Makarov, K. Ye, and J. Sebat Sensitive and accurate detection of copy number variants using read depth of coverage Genome Res., September 1, 2009; 19(9): 1586 - 1592. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. LaFramboise Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances Nucleic Acids Res., July 1, 2009; (2009) gkp552v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Y. Wu, H. A. Chipman, S. B. Bull, L. Briollais, and K. Wang A Bayesian segmentation approach to ascertain copy number variations at the population level Bioinformatics, July 1, 2009; 25(13): 1669 - 1679. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Shah, K-J. Cheung Jr, N. A. Johnson, G. Alain, R. D. Gascoyne, D. E. Horsman, R. T. Ng, and K. P. Murphy Model-based clustering of array CGH data Bioinformatics, June 15, 2009; 25(12): i30 - i38. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. LaFramboise, W. Winckler, and R. K. Thomas A flexible rank-based framework for detecting copy number aberrations from array data Bioinformatics, March 15, 2009; 25(6): 722 - 728. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Rujescu, A. Ingason, S. Cichon, O. P.H. Pietilainen, M. R. Barnes, T. Toulopoulou, M. Picchioni, E. Vassos, U. Ettinger, E. Bramon, et al. Disruption of the neurexin 1 gene is associated with schizophrenia Hum. Mol. Genet., March 1, 2009; 18(5): 988 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Cronin, H. M. Blauw, J. H. Veldink, M. A. van Es, R. A. Ophoff, D. G. Bradley, L. H. van den Berg, and O. Hardiman Analysis of genome-wide copy number variation in Irish and Dutch ALS populations Hum. Mol. Genet., November 1, 2008; 17(21): 3392 - 3398. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Wang and M. Bucan Copy Number Variation Detection via High-Density SNP Genotyping CSH Protocols, June 1, 2008; 2008(7): pdb.top46 - pdb.top46. [Abstract] [Full Text] |
||||
![]() |
R. Andersson, C. E. G. Bruder, A. Piotrowski, U. Menzel, H. Nord, J. Sandgren, T. R. Hvidsten, T. Diaz de Stahl, J. P. Dumanski, and J. Komorowski A segmental maximum a posteriori approach to genome-wide copy number profiling Bioinformatics, March 15, 2008; 24(6): 751 - 758. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Wang, M. Li, D. Hadley, R. Liu, J. Glessner, S. F.A. Grant, H. Hakonarson, and M. Bucan PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data Genome Res., November 1, 2007; 17(11): 1665 - 1674. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. O. Korbel, A. E. Urban, F. Grubert, J. Du, T. E. Royce, P. Starr, G. Zhong, B. S. Emanuel, S. M. Weissman, M. Snyder, et al. Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome PNAS, June 12, 2007; 104(24): 10110 - 10115. [Abstract] [Full Text] [PDF] |
||||









