CIPR: a web-based R/shiny app and R package to annotate cell clusters in single cell RNA sequencing experiments
- PDF / 3,861,289 Bytes
- 15 Pages / 595.276 x 793.701 pts Page_size
- 93 Downloads / 195 Views
SOFTWARE
Open Access
CIPR: a web-based R/shiny app and R package to annotate cell clusters in single cell RNA sequencing experiments H. Atakan Ekiz1,2*, Christopher J. Conley3, W. Zac Stephens1,2 and Ryan M. O’Connell1,2* * Correspondence: atakan.ekiz@ path.utah.edu; ryan.oconnell@path. utah.edu 1 Division of Microbiology and Immunology, Department of Pathology, University of Utah, 15 N. Medical Dr. East, JMRB, Salt Lake City, UT 84112, USA Full list of author information is available at the end of the article
Abstract Background: Single cell RNA sequencing (scRNAseq) has provided invaluable insights into cellular heterogeneity and functional states in health and disease. During the analysis of scRNAseq data, annotating the biological identity of cell clusters is an important step before downstream analyses and it remains technically challenging. The current solutions for annotating single cell clusters generally lack a graphical user interface, can be computationally intensive or have a limited scope. On the other hand, manually annotating single cell clusters by examining the expression of marker genes can be subjective and labor-intensive. To improve the quality and efficiency of annotating cell clusters in scRNAseq data, we present a web-based R/Shiny app and R package, Cluster Identity PRedictor (CIPR), which provides a graphical user interface to quickly score gene expression profiles of unknown cell clusters against mouse or human references, or a custom dataset provided by the user. CIPR can be easily integrated into the current pipelines to facilitate scRNAseq data analysis. Results: CIPR employs multiple approaches for calculating the identity score at the cluster level and can accept inputs generated by popular scRNAseq analysis software. CIPR provides 2 mouse and 5 human reference datasets, and its pipeline allows interspecies comparisons and the ability to upload a custom reference dataset for specialized studies. The option to filter out lowly variable genes and to exclude irrelevant reference cell subsets from the analysis can improve the discriminatory power of CIPR suggesting that it can be tailored to different experimental contexts. Benchmarking CIPR against existing functionally similar software revealed that our algorithm is less computationally demanding, it performs significantly faster and provides accurate predictions for multiple cell clusters in a scRNAseq experiment involving tumor-infiltrating immune cells. (Continued on next page)
© The Author(s). 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If ma
Data Loading...