A Phylogenetic Rate Parameter Indicates Different Sequence Divergence Patterns in Orthologs and Paralogs
- PDF / 1,636,523 Bytes
- 11 Pages / 595.276 x 790.866 pts Page_size
- 61 Downloads / 202 Views
ORIGINAL ARTICLE
A Phylogenetic Rate Parameter Indicates Different Sequence Divergence Patterns in Orthologs and Paralogs Joseph B. Ahrens1,3 · Ashley I. Teufel2,4 · Jessica Siltberg‑Liberles1 Received: 12 March 2020 / Accepted: 15 October 2020 © Springer Science+Business Media, LLC, part of Springer Nature 2020
Abstract Heterotachy—the change in sequence evolutionary rate over time—is a common feature of protein molecular evolution. Decades of studies have shed light on the conditions under which heterotachy occurs, and there is evidence that site-specific evolutionary rate shifts are correlated with changes in protein function. Here, we present a large-scale, computational analysis using thousands of protein sequence alignments from animal and plant proteomes, representing genes related either by orthology (speciation events) or paralogy (gene duplication), to compare sequence divergence patterns in orthologous vs. paralogous sequence alignments. We use sequence-based phylogenetic analyses to infer overall sequence divergence (tree length/number of sequences) and to fit site-specific rates to a discrete gamma distribution with a shape parameter α. This inference method is applied to real protein sequence alignments, as well as alignments simulated under various models of protein sequence evolution. Our simulations indicate that sequence divergence and the α parameter are positively correlated when sequences evolve with heterotachy, meaning that inferred site rate distributions appear more uniform as sequences diverge. Divergence and α are also positively correlated in both orthologous and paralogous genes, but the average increase in α (as a function of divergence) is significantly higher in paralogous protein alignments than in orthologous alignments. This result is consistent with the widely held view that recently duplicated proteins initially evolve under relaxed selective pressure, promoting functional divergence by accumulation of amino acid replacements, and hence experience more evolutionary rate fluctuations than orthologous proteins. We discuss these findings in the context of the ortholog conjecture, a long-standing assumption in molecular evolution, which posits that protein sequences related by orthology tend to be more functionally conserved than paralogous proteins. Keywords Heterotachy · Ortholog · Paralog · Protein · Rate heterogeneity
Handling Editor: Jason de Koning. Electronic supplementary material The online version of this article (https://doi.org/10.1007/s00239-020-09969-7) contains supplementary material, which is available to authorized users. * Joseph B. Ahrens [email protected] * Jessica Siltberg‑Liberles [email protected] 1
Department of Biological Sciences, Biomolecular Sciences Institute, Florida International University, Miami, FL, USA
2
Department of Integrative Biology, The University of Texas At Austin, Austin, TX, USA
3
Present Address: Department of Biochemistry and Molecular Genetics, Computational Bioscience Program, University of Colorado Denver, Auror
Data Loading...