Shrinking of repeating unit length in leucine-rich repeats from double-stranded DNA viruses
- PDF / 3,333,791 Bytes
- 22 Pages / 595.276 x 790.866 pts Page_size
- 82 Downloads / 141 Views
ORIGINAL ARTICLE
Shrinking of repeating unit length in leucine‑rich repeats from double‑stranded DNA viruses Norio Matsushima1,2 · Hiroki Miyashita1,3 · Shinsuke Tamaki1 · Robert H. Kretsinger4 Received: 20 May 2020 / Accepted: 19 August 2020 © Springer-Verlag GmbH Austria, part of Springer Nature 2020
Abstract Leucine-rich repeats (LRRs) are present in over 563,000 proteins from viruses to eukaryotes. LRRs repeat in tandem and have been classified into fifteen classes in which the repeat unit lengths range from 20 to 29 residues. Most LRR proteins are involved in protein-protein or ligand interactions. The amount of genome sequence data from viruses is increasing rapidly, and although viral LRR proteins have been identified, a comprehensive sequence analysis has not yet been done, and their structures, functions, and evolution are still unknown. In the present study, we characterized viral LRRs by sequence analysis and identified over 600 LRR proteins from 89 virus species. Most of these proteins were from double-stranded DNA (dsDNA) viruses, including nucleocytoplasmic large dsDNA viruses (NCLDVs). We found that the repeating unit lengths of 11 types are one to five residues shorter than those of the seven known corresponding LRR classes. The repeating units of six types are 19 residues long and are thus the shortest among all LRRs. In addition, two of the LRR types are unique and have not been observed in bacteria, archae or eukaryotes. Conserved strongly hydrophobic residues such as Leu, Val or Ile in the consensus sequences are replaced by Cys with high frequency. Phylogenetic analysis indicated that horizontal gene transfer of some viral LRR genes had occurred between the virus and its host. We suggest that the shortening might contribute to the survival strategy of viruses. The present findings provide a new perspective on the origin and evolution of LRRs.
Introduction Leucine-rich repeats (LRRs) are present in over 563,000 proteins from viruses to eukaryotes with sequences in the databases. LRRs repeat in tandem. The repeating units are 20–29 residues long. Short β-strands in the repeating units stack in parallel and thus form a superhelix arrangement called a solenoid structure [1]. Most LRR proteins are involved Handling Editor: Tim Skern. Electronic supplementary material The online version of this article (https://doi.org/10.1007/s00705-020-04820-2) contains supplementary material, which is available to authorized users. * Norio Matsushima [email protected] 1
Division of Bioinformatics, Institute of Tandem Repeats, Noboribetsu 059‑0464, Japan
2
Center for Medical Education, Sapporo Medical University, Sapporo 060‑8556, Japan
3
Hokubu Rinsho Co., Ltd, Sapporo 060‑0061, Japan
4
Department of Biology, University of Virginia, Charlottesville, VA 22904, USA
in protein-protein or ligand interactions, as observed in the plant immune response and the vertebrate innate immune response [2, 3]. Genome sequence data of nucleocytoplasmic large dsDNA viruses (NCLDVs) including members
Data Loading...