SNPTrack TM : an integrated bioinformatics system for genetic association studies

  • PDF / 239,154 Bytes
  • 3 Pages / 595.28 x 793.7 pts Page_size
  • 98 Downloads / 173 Views

DOWNLOAD

REPORT


GENOME DATABASE

Open Access

SNPTrackTM : an integrated bioinformatics system for genetic association studies Joshua Xu1, Reagan Kelly1, Guangxu Zhou1, Steven A Turner1, Don Ding1, Stephen C Harris2, Huixiao Hong2, Hong Fang1* and Weida Tong2*

Abstract A genetic association study is a complicated process that involves collecting phenotypic data, generating genotypic data, analyzing associations between genotypic and phenotypic data, and interpreting genetic biomarkers identified. SNPTrack is an integrated bioinformatics system developed by the US Food and Drug Administration (FDA) to support the review and analysis of pharmacogenetics data resulting from FDA research or submitted by sponsors. The system integrates data management, analysis, and interpretation in a single platform for genetic association studies. Specifically, it stores genotyping data and single-nucleotide polymorphism (SNP) annotations along with study design data in an Oracle database. It also integrates popular genetic analysis tools, such as PLINK and Haploview. SNPTrack provides genetic analysis capabilities and captures analysis results in its database as SNP lists that can be cross-linked for biological interpretation to gene/protein annotations, Gene Ontology, and pathway analysis data. With SNPTrack, users can do the entire stream of bioinformatics jobs for genetic association studies. SNPTrack is freely available to the public at http://www.fda.gov/ScienceResearch/BioinformaticsTools/SNPTrack/ default.htm.

Introduction Personalized medicine will improve health outcomes and patient satisfaction. However, implementing personalized medicine based on individuals' biological information relies on genetic biomarkers that are identified through genetic association studies. High-throughput genotyping technologies have been advanced to enable the simultaneous determination of genotypes for millions of single-nucleotide polymorphisms (SNPs). Concurrently, the International HapMap Project determined genotypes of over 3.1 million common SNPs in human populations [1]. These advances combine to make genetic association studies a feasible and promising research field for personalized medicine. However, there are a number of bioinformatics challenges associated with the enormous amount of genetic data generated by high-throughput technologies. Storing and accessing the data, performing association tests, and interpreting results can no longer be readily done using ad hoc approaches * Correspondence: [email protected]; [email protected] 1 ICF International at NCTR, 3900 NCTR Rd, Jefferson, AR 72079, USA 2 Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR Rd, Jefferson AR 72079, USA Full list of author information is available at the end of the article

commonly utilized for much smaller candidate gene association studies. Furthermore, because contributions of individual polymorphisms to a phenotype are typically quite small, appropriate analysis and interpretati