LSTrAP-Crowd: prediction of novel components of bacterial ribosomes with crowd-sourced analysis of RNA sequencing data

  • PDF / 3,288,810 Bytes
  • 13 Pages / 595.276 x 790.866 pts Page_size
  • 108 Downloads / 188 Views

DOWNLOAD

REPORT


RESEARCH ARTICLE

Open Access

LSTrAP-Crowd: prediction of novel components of bacterial ribosomes with crowd-sourced analysis of RNA sequencing data Benedict Hew, Qiao Wen Tan, William Goh, Jonathan Wei Xiong Ng and Marek Mutwil*

Abstract Background: Bacterial resistance to antibiotics is a growing health problem that is projected to cause more deaths than cancer by 2050. Consequently, novel antibiotics are urgently needed. Since more than half of the available antibiotics target the structurally conserved bacterial ribosomes, factors involved in protein synthesis are thus prime targets for the development of novel antibiotics. However, experimental identification of these potential antibiotic target proteins can be labor-intensive and challenging, as these proteins are likely to be poorly characterized and specific to few bacteria. Here, we use a bioinformatics approach to identify novel components of protein synthesis. Results: In order to identify these novel proteins, we established a Large-Scale Transcriptomic Analysis Pipeline in Crowd (LSTrAP-Crowd), where 285 individuals processed 26 terabytes of RNA-sequencing data of the 17 most notorious bacterial pathogens. In total, the crowd processed 26,269 RNA-seq experiments and used the data to construct gene co-expression networks, which were used to identify more than a hundred uncharacterized genes that were transcriptionally associated with protein synthesis. We provide the identity of these genes together with the processed gene expression data. Conclusions: We identified genes related to protein synthesis in common bacterial pathogens and thus provide a resource of potential antibiotic development targets for experimental validation. The data can be used to explore additional vulnerabilities of bacteria, while our approach demonstrates how the processing of gene expression data can be easily crowd-sourced. Keywords: Co-expression, Bacteria, Ribosome, Protein synthesis, RNA-seq, Crowdsourcing

Background Bacterial resistance to antibiotics is a serious and growing concern in public health, taking ca. 99,000 lives and costing 21–34 billion USD per year in the USA [1]. Methicillin-resistant Gram-positive Staphylococcus aureus (MRSA) and Gram-negative Pseudomonas aeruginosa are the leading causes of serious infections as they * Correspondence: [email protected] School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore

form biofilms. Biofilms are complex bacterial communities embedded in an extracellular matrix, and these communities are able to resist antimicrobial agents [2]. For instance, bacteria can be up to 1000× more tolerant to antibiotics when they grow as a biofilm, compared to single-cell suspension (planktonic cells). Consequently, new antibiotics are urgently needed to combat these resistance mechanisms, either alone or in combination with existing drugs.

© The Author(s). 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use,