A survey of analysis software for array-comparative genomic hybridisation studies to detect copy number variation

  • PDF / 101,685 Bytes
  • 7 Pages / 609.449 x 790.866 pts Page_size
  • 12 Downloads / 163 Views

DOWNLOAD

REPORT


A survey of analysis software for array-comparative genomic hybridisation studies to detect copy number variation Anis Karimpour-Fard,1* Laura Dumas,2 Tzulip Phang,3 James M. Sikela2 and Lawrence E. Hunter1 1

Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, CO 80045, USA Department of Pharmacology, Neuroscience and Human Medical Genetics Programs, University of Colorado at Denver and Health Sciences Center, Aurora, CO 80045, USA 3 Department of Medicine, University of Colorado Denver School of Medicine, Denver, CO 80045, USA *Correspondence to: Tel.: þ1 (303)724 0274; Fax: þ1 (303)724 3663; E-mail: [email protected] 2

Date received (in revised form): 27th August 2010

Abstract Copy number variants (CNVs) create a major source of variation among individuals and populations. Array-based comparative genomic hybridisation (aCGH) is a powerful method used to detect and compare the copy numbers of DNA sequences at high resolution along the genome. In recent years, several informatics tools for accurate and efficient CNV detection and assessment have been developed. In this paper, most of the well known algorithms, analysis software and the limitations of that software will be briefly reviewed. Keywords: copy number variants, CNV, deletion, insertion, duplication, aCGH

Background Copy number variants (CNVs) are DNA sequences that are present in different amounts among individuals in a population. Copy number differences can confer a change in gene expression, phenotypic variation, disease susceptibility,1 – 5 and gene and genome evolution.6,7 Repetitive sequences that flank a specific genomic region can further facilitate a duplication or deletion of that region via the mechanism of non-allelic homologous recombination, which can occur when paralogous sequences in the genome mis-pair during meiois.8 – 10 A key method used to study CNVs across individuals is that of array-based comparative genomic hybridisation (aCGH). The goal of aCGH experiments is to detect and compare the copy numbers of DNA sequences at high resolution along the genome. Several informatics tools currently exist for accurate and efficient CNV detection and assessment.

These tools assist in automated analysis of array CGH data and user-friendly copy number reporting for individual samples. The goal of the statistical algorithms used in these software programs is to call aberrations reliably, accurately and precisely. The analysis of CNVs is broken down into several steps, including: (i) pre-processing and normalisation of the raw data; (ii) aligning data with its genome location, conducting segmentation analysis and providing statistical analysis to ensure the reliability of detection; and (iii) post-processing to assign biological meaning to the different states. (i) Normalisation of the log2 ratios is typically conducted in an attempt to adjust for sources of systematic variation. Since these effects are often not known or measured, most aCGH methodologies incorporate global normalisation tec