Genome- wide structural and functional variant discovery of rice landraces using genotyping by sequencing

  • PDF / 1,651,644 Bytes
  • 12 Pages / 595.276 x 790.866 pts Page_size
  • 68 Downloads / 205 Views

DOWNLOAD

REPORT


ORIGINAL ARTICLE

Genome‑ wide structural and functional variant discovery of rice landraces using genotyping by sequencing Smitha Kunhiraman Vasumathy1 · Maya Peringottillam1 · Krishna T. Sundaram2 · S. Hari Krishna Kumar1 · Manickavelu Alagu1  Received: 7 June 2020 / Accepted: 28 August 2020 © Springer Nature B.V. 2020

Abstract Rice landraces are vital genetic resources for agronomic and quality traits but the undeniable collection of Kerala landraces remains poorly delineated. To effectively conserve, manage, and use these resources, understanding the genomic structure of germplasm is essential. Genotyping by sequencing (GBS) enables identification of an immense number of single nucleotide polymorphism (SNP) and insertion deletion (InDel) from 96 rice germplasm. In the present study, a total of 16.9 × 107 reads were generated, and among that 16.3 × 107 reads were mapped to the indica reference genome. Exploring GBS data unfolded a wide genomic variations including 82,59,639 SNPs and 1,07,140 Indels. Both neighbor-joining tree and principal coordinate analysis with InDel markers revealed the selected germplasm in this study as highly diverse in structure. We assembled unmapped reads which were further employed for gene ontology analysis. These unmapped sequences that are generally expelled from subsequent studies of GBS data analysis may exist as an unexplored resort for several novel significant biological findings. The discovery of SNPs from the haplotyping results of GS3 and GIF1 genes provided insight into marker- assisted selection based on grain size and yield and can be utilized for rice yield improvement. To our knowledge, this is the first report on structural variation analysis using the GBS platform in rice landraces collected from Kerala. Genomic information from this study endows with valuable resources for perceptive rice landrace structure and can also facilitate sequencing-based molecular breeding. Keywords  Rice landraces · Genotyping by sequencing · SNP · InDel · Unmapped sequences · Gene ontology

Introduction Rice (Oryza sativa L.) is grown in more than a 100 countries, with a production of 700 million tons per annum from a total harvested area of about 158 million hectares globally. As it is one of the major diets consumed by more than half of the people in the world, its production level has to be increased considerably to feed the mounting population. Under the Electronic supplementary material  The online version of this article (https​://doi.org/10.1007/s1103​3-020-05794​-9) contains supplementary material, which is available to authorized users. * Manickavelu Alagu [email protected] 1



Department of Genomic Science, Central University of Kerala, Kasaragod, Kerala 671316, India



South Asia hub, ICRISAT Campus, International Rice Research Institute, Secundarabad, Telangana, India

2

sway of farmer’s desire and environmental circumstances, the rice originated in South East Asia expanded into 88,681 different varieties within which 1,171 are wild races, 55,615 are landraces, an