Predicting protein subchloroplast locations: the 10th anniversary
- PDF / 368,086 Bytes
- 11 Pages / 612.284 x 802.205 pts Page_size
- 32 Downloads / 227 Views
Predicting protein subchloroplast locations: the 10th anniversary Jian SUN, Pu-Feng DU College of Intelligence and Computing, Tianjin University, Tianjin 300350, China c Higher Education Press 2020
Abstract Chloroplast is a type of subcellular organelle in green plants and algae. It is the main subcellular organelle for conducting photosynthetic process. The proteins, which localize within the chloroplast, are responsible for the photosynthetic process at molecular level. The chloroplast can be further divided into several compartments. Proteins in different compartments are related to different steps in the photosynthetic process. Since the molecular function of a protein is highly correlated to the exact cellular localization, pinpointing the subchloroplast location of a chloroplast protein is an important step towards the understanding of its role in the photosynthetic process. Experimental process for determining protein subchloroplast location is always costly and time consuming. Therefore, computational approaches were developed to predict the protein subchloroplast locations from the primary sequences. Over the last decades, more than a dozen studies have tried to predict protein subchloroplast locations with machine learning methods. Various sequence features and various machine learning algorithms have been introduced in this research topic. In this review, we collected the comprehensive information of all existing studies regarding the prediction of protein subchloroplast locations. We compare these studies in the aspects of benchmarking datasets, sequence features, machine learning algorithms, predictive performances, and the implementation availability. We summarized the progress and current status in this special research topic. We also try to figure out the most possible future works in predicting protein subchloroplast locations. We hope this review not only list all existing works, but also serve the readers as a useful resource for quickly grasping the big picture of this research topic. We also hope this review work can be a starting point of future methodology studies regarding the prediction of protein subchloroplast locations. Keywords subchloroplast locations, sequence features, performance measures, online services, machine learning
1
Introduction
Photosynthetic process, which deemed to be the most important biological process in this world, convert light energy to chemical energy. The chemical energy, which is stored in the form of carbohydrate molecules, can be utilized in almost every other cellular process. It is believed that the energy in the crude Received December 19, 2019; accepted April 29, 2020 E-mail: [email protected]
oil, which drives the modern civilization, is actually converted and stored by the photosynthetic process in the past billions of years. Chloroplasts, which are subcellular organelles, are responsible for conducting photosynthesis process in almost every green plant, as well as algae. A chloroplast is a type of plastid. Unlike other plastid types, such as the leuc
Data Loading...