Alignment and mapping methodology influence transcript abundance estimation

PDF / 6,510,843 Bytes
29 Pages / 595 x 794 pts Page_size
43 Downloads / 213 Views

RESEARCH

Open Access

Alignment and mapping methodology influence transcript abundance estimation Avi Srivastava1† , Laraib Malik1† , Hirak Sarkar2 , Mohsen Zakeri2 , Fatemeh Almodaresi2 , Charlotte Soneson3,4 , Michael I. Love5,6 , Carl Kingsford7 and Rob Patro2* *Correspondence: [email protected] † Avi Srivastava and Laraib Malik contributed equally to this work. 2 Department of Computer Science, University of Maryland, College Park, USA Full list of author information is available at the end of the article

Abstract Background: The accuracy of transcript quantification using RNA-seq data depends on many factors, such as the choice of alignment or mapping method and the quantification model being adopted. While the choice of quantification model has been shown to be important, considerably less attention has been given to comparing the effect of various read alignment approaches on quantification accuracy. Results: We investigate the influence of mapping and alignment on the accuracy of transcript quantification in both simulated and experimental data, as well as the effect on subsequent differential expression analysis. We observe that, even when the quantification model itself is held fixed, the effect of choosing a different alignment methodology, or aligning reads using different parameters, on quantification estimates can sometimes be large and can affect downstream differential expression analyses as well. These effects can go unnoticed when assessment is focused too heavily on simulated data, where the alignment task is often simpler than in experimentally acquired samples. We also introduce a new alignment methodology, called selective alignment, to overcome the shortcomings of lightweight approaches without incurring the computational cost of traditional alignment. Conclusion: We observe that, on experimental datasets, the performance of lightweight mapping and alignment-based approaches varies significantly, and highlight some of the underlying factors. We show this variation both in terms of quantification and downstream differential expression analysis. In all comparisons, we also show the improved performance of our proposed selective alignment method and suggest best practices for performing RNA-seq quantification. Keywords: RNA-seq, Read-alignment, Quantification

© The Author(s). 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permiss

Data Loading...

Alignment and mapping methodology influence transcript abundance estimation

Recommend Documents

Transcript

Transcript

Maternal Transcript

Efficient influence spread estimation for influence maximization

Quantitative Estimation of Nitrogen Fixation: 15N2 Natural Abundance Study

Transcript 1

Transcript Elongation

Ageing and ovarian stimulation modulate the relative levels of transcript abundance of oocyte DNA repair genes during th

RGB-D camera calibration and trajectory estimation for indoor mapping

Unsupervised Cross-Modal Alignment for Multi-person 3D Pose Estimation

Risk Mapping Methodology for Agricultural Cooperative Sector Enterprises

Frequency of intron loss correlates with processed pseudogene abundance: a novel strategy to test the reverse transcript