Extraction of Mitochondrial Genome from Whole Genome Next Generation Sequencing Data and Unveiling of Forensically Relev

  • PDF / 2,859,753 Bytes
  • 10 Pages / 612 x 792 pts (letter) Page_size
  • 61 Downloads / 229 Views

DOWNLOAD

REPORT


ATHEMATICAL MODELS AND METHODS

Extraction of Mitochondrial Genome from Whole Genome Next Generation Sequencing Data and Unveiling of Forensically Relevant Markers S. Raufa, *, N. Zahraa, S. S. Malika, S. A. e Zahraa, K. Sughrab, and M. R. Khana, c, ** aGenome

Editing and Sequencing Lab, National Centre for Bioinformatics, Quaid-i-Azam University Islamabad, Pakistan b Department of Biochemistry and Molecular Biology, University of Gujrat, Gujrat Pakistan c National Institute for Genomics and Advanced Biotechnology, National Agricultural Research Centre, Islamabad, Pakistan *e-mail: [email protected] **e-mail: [email protected] Received July 21, 2019; revised October 2, 2019; accepted March 12, 2020

Abstract—Forensic science has benefitted a lot from STRs and SNP markers at genetic level but their analyses at genomic level especially isolation of mitochondrial genome from whole genome sequence and unveiling of forensic markers remained obscure. In the present study whole genome next generation sequencing using Illumina HiSeq-4000 platform was done followed by separation of mtDNA and extraction of SNPs and STRs was accomplished using computational pipelines. Mitochondrial genome sequence was successfully extracted from whole genome sequencing data of a Pothwari ethnic individual of Pakistan. Two heteroplasmic sites were identified in genes MT-RNR1 and ND6 which located in control and coding regions, respectively. Comparison of target whole genome sequence with reference genome database revealed 2436328 SNPs, 119399 insertions and 119290 deletions for which variable ratios of transitions/transversion and heterozygosity/homozygosity were observed. A total of 63 forensically relevant SNPs including 21 ancestry-informative, 40 identity-informative and 2 phenotypic-informative were identified. Besides SNPs, two types of genetic markers which include 8 CODIS and 39 Y-STRs were also obtained. CODIS STRs exhibited more variations than Y-STRs as these are not of crossing over; hence mostly remain conserved in generations with exception of rapidly mutating YSTRs which are crucial in differentiating closely and distantly related males. This study reports whole genome sequencing and development of pipelines for extraction of mitochondrial DNA as well as insight into detection of forensic markers i.e., SNPs and STRs which can be an initiative to develop local forensic databases and a record resource for ethnic crimination-detection in crimes and disasters. Keywords: mitochondrial DNA, whole genome next generation sequencing, forensics, SNPs, STRs DOI: 10.1134/S1022795420080128

1. INTRODUCTION Forensic science is strikingly intricate and incorporates techniques extending from DNA examination to the pattern identification [1]. For the last two decades, length-based STR markers have served as gold standards for suspect identification in different forensic cases. In forensic DNA analysis, Y-STRs due to their efficiency in providing polymorphism existing in Y-chromosome show exceptional power to discriminate male sample wit