Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments
- PDF / 734,830 Bytes
- 9 Pages / 595.276 x 790.866 pts Page_size
- 33 Downloads / 165 Views
RESEARCH
Open Access
Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments Nicolas Borisov1,2* , Maxim Sorokin1,3, Victor Tkachev1, Andrew Garazha1 and Anton Buzdin1,2,3,4 From 11th International Young Scientists School “Systems Biology and Bioinformatics” – SBB-2019 Novosibirsk, Russia. 24-28 June 2019
Abstract Background: Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn’t allow sufficient training of ML classifiers that could be used for improving molecular diagnostics. Methods: We reviewed published datasets of high throughput gene expression profiles corresponding to cancer patients with known responses on chemotherapy treatments. We browsed Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and Tumor Alterations Relevant for GEnomics-driven Therapy (TARGET) repositories. Results: We identified data collections suitable to build ML models for predicting responses on certain chemotherapeutic schemes. We identified 26 datasets, ranging from 41 till 508 cases per dataset. All the datasets identified were checked for ML applicability and robustness with leave-one-out cross validation. Twenty-three datasets were found suitable for using ML that had balanced numbers of treatment responder and non-responder cases. Conclusions: We collected a database of gene expression profiles associated with clinical responses on chemotherapy for 2786 individual cancer cases. Among them seven datasets included RNA sequencing data (for 645 cases) and the others – microarray expression profiles. The cases represented breast cancer, lung cancer, low-grade glioma, endothelial carcinoma, multiple myeloma, adult leukemia, pediatric leukemia and kidney tumors. Chemotherapeutics included taxanes, bortezomib, vincristine, trastuzumab, letrozole, tipifarnib, temozolomide, busulfan and cyclophosphamide. Keywords: Machine learning, Transcriptomics, Gene expression, RNA sequencing, Microarrays, Molecular diagnostics, Biomarkers detection, Cancer, Clinical oncology, Personalized medicine, Chemotherapy
* Correspondence: [email protected] 1 Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA 91788, USA 2 Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Oblast 141701, Russia Full list of author information is available at the end of the article © The Author(s). 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the a
Data Loading...