Sorghum ranks fifth in cereal production and acreage behind maize, rice, wheat and barley (http://www.fao.org). It is cultivated in vast geographic areas in the Americas, Africa, Asia, and Oceania. Sorghum's excellent agronomic and biological properties, such as heat and drought tolerance, make it a vital grain crop in marginal land for production without competing against other major food crops [1]. With the increase of world population and the decrease of water resources, sorghum will become the preferred food crop all over the world in the future. Furthermore, sorghum is not only harvested for grain, but also often used to produce syrup, grazing and biomass production [2].
As a model organism that carries out C4 photosynthesis, sorghum was the second sequenced cereal crop after the C3 organism rice [3, 4]. The comparatively small genome of sorghum makes it a potential genetic model for the design of bioenergy crops compared with the larger and more repetitive genomes of other major C4 crops, such as maize and sugarcane. With the improvement of the reference genome (BTx623) [4, 5] and the development of sequencing technologies, studies on domestication and genetic mechanism of distinct phenotype in sorghum have been greatly accelerated [2, 6–15].
During the past decade, diverse web resources have been constructed to exhibit numerous omics data, which is beneficial for the sorghum research community (Table 1). Plant specific genome databases such as Phytozome [16] and Gramene [17], as well as the most comprehensive Genome OnLine Database (GOLD) [18] are widely used as data sources and analysis platforms for sorghum research. On the other hand, sorghum included plant secondary databases such as PIGD [19], PlanTFDB [20], DNApod [21], PceRBase [22], PtRFdb [23] and GreenPhylDB [24] have vital modules about sorghum resources. Finally, the sorghum specific secondary databases, including MOROKOSHI [25], PGSB [26], SorghumFDB [27], Sorghum QTL Atlas [28], and Sorghum Genomics, are a cluster of websites dedicated to sorghum researches. Among them, SorghumFDB is the most comprehensive sorghum specific database, which contains extensive public genomic and functional annotations data, as well as useful analysis tools. With published sorghum genome re-sequencing data of 48 accessions, we developed a sorghum SNP database (SorGSD) in 2016, providing the sorghum user community with abundant SNPs and some other resources related to sorghum genetics and genomics [29].
Table 1
Online databases for sorghum genome
Name | URL / Description | PubMed ID |
Comprehensive genome databases and analysis platforms | |
Phytozome | https://phytozome.jgi.doe.gov/ Plant genome database portal and analysis platform | [16] 22110026 |
Gramene | http://www.gramene.org/ Plant genome database portal and analysis platform | [17] 33170273 |
GOLD | https://gold.jgi.doe.gov/ Genomes online database | [18] 33152092 |
Sorghum included plant secondary databases | |
PIGD | http://pigd.ahau.edu.cn/ A database for intronless genes in Poaceae | [19] 25270086 |
PlantTFDB | http://planttfdb.gao-lab.org/ A database of plant transcription factors | [20] 27924042 |
DNApod | http://tga.nig.ac.jp/dnapod DNA polymorphism annotation database | [21] 28234924 |
PceRBase | http://bis.zju.edu.cn/pcernadb/ A database of plant competing endogenous RNA | [22] 28053167 |
PtRFdb | http://www.nipgr.res.in/PtRFdb A database for plant tRNA-derived fragments | [23] 29939244 |
GreenPhylDB | https://www.greenphyl.org/ A comparative pangenomic database for plant genomes | [24] 33237299 |
Sorghum specific secondary databases | |
MOROKOSHI | http://sorghum.riken.jp/ Sorghum transcriptome database | [25] 25505007 |
SorGSD | http://sorgsd.big.ac.cn/ Sorghum SNP database | [29] 26884811 |
PGSB | http://pgsb.helmholtz-muenchen.de/plant/sorghum/ Plant genome and systems biology | [26] 26527721 |
SorghumFDB | http://structuralbiology.cau.edu.cn/sorghum/ A database for sorghum functional genomics | [27] 27352859 |
Sorghum QTL Atlas | https://aussorgm.org.au/sorghum-qtl-atlas/ Tool for searching QTL landscape in sorghum | [28] 30343386 |
Sorghum Genomics | https://www.purdue.edu/sorghumgenomics/ Functional Gene Discovery Platform for Sorghum | N/A |
Here we announce and describe the second major release of the sorghum genome science database (SorGSD). The goal of the redesign is to construct a comprehensive database with sorghum genomic variations and phenotypes. Compared with the first version SorGSD which contains SNPs of 48 sorghum accessions, the second version provides a more extensive set of genomic variation data for both SNPs and small INDELs of 289 sorghum accessions, as well as characteristic phenotypic information and panicle pictures of critical sorghum lines. We also provide three useful tools, including ID Conversion, Homologue Search and Genome Browser, in the new release. The back-end database framework and the web interface were redesigned as a part of the Genome Variation Map at the National Genomics Data Center (NGDC) and China National Center for Bioinformation (CNCB). We hope that these data and tools are beneficial for exploring genetic variations and evolution studies of sorghum and other species. The new version SorGSD is freely accessible at https://bigd.big.ac.cn/sorgsd/.