Profiling of copy number variations (CNVs) in healthy individuals from three ethnic groups using a human genome 32 K BAC-clone-based array

Hum Mutat. 2008 Mar;29(3):398-408. doi: 10.1002/humu.20659.

Abstract

To further explore the extent of structural large-scale variation in the human genome, we assessed copy number variations (CNVs) in a series of 71 healthy subjects from three ethnic groups. CNVs were analyzed using comparative genomic hybridization (CGH) to a BAC array covering the human genome, using DNA extracted from peripheral blood, thus avoiding any culture-induced rearrangements. By applying a newly developed computational algorithm based on Hidden Markov modeling, we identified 1,078 autosomal CNVs, including at least two neighboring/overlapping BACs, which represent 315 distinct regions. The average size of the sequence polymorphisms was approximately 350 kb and involved in total approximately 117 Mb or approximately 3.5% of the genome. Gains were about four times more common than deletions, and segmental duplications (SDs) were overrepresented, especially in larger deletion variants. This strengthens the notion that SDs often define hotspots of chromosomal rearrangements. Over 60% of the identified autosomal rearrangements match previously reported CNVs, recognized with various platforms. However, results from chromosome X do not agree well with the previously annotated CNVs. Furthermore, data from single BACs deviating in copy number suggest that our above estimate of total variation is conservative. This report contributes to the establishment of the common baseline for CNV, which is an important resource in human genetics.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Asian People / genetics
  • Black People / genetics
  • Chromosomes, Artificial, Bacterial / genetics
  • Chromosomes, Human, X / genetics
  • Female
  • Gene Dosage*
  • Gene Duplication
  • Gene Rearrangement
  • Genetic Variation*
  • Genome, Human
  • Humans
  • Male
  • Markov Chains
  • Oligonucleotide Array Sequence Analysis
  • Racial Groups / genetics*
  • White People / genetics