BackgroundResearch on orphan crops is often hindered by a lack of genomic resources. With the advent of affordable sequencing technologies, genotyping an entire genome or, for large-genome species, a representative fraction of the genome has become feasible for any crop. Nevertheless, most genotyping-by-sequencing (GBS) methods are geared towards obtaining large numbers of markers at low sequence depth, which excludes their application in heterozygous individuals. Furthermore, bioinformatics pipelines often lack the flexibility to deal with paired-end reads or to be applied in polyploid species.ResultsUGbS-Flex combines publicly available software with in-house python and perl scripts to efficiently call SNPs from genotyping-by-sequencing reads irrespective of the species’ ploidy level, breeding system and availability of a reference genome. Noteworthy features of the UGbS-Flex pipeline are an ability to use paired-end reads as input, an effective approach to cluster reads across samples with enhanced outputs, and maximization of SNP calling. We demonstrate use of the pipeline for the identification of several thousand high-confidence SNPs with high representation across samples in an F3-derived F2 population in the allotetraploid finger millet. Robust high-density genetic maps were constructed using the time-tested mapping program MAPMAKER which we upgraded to run efficiently and in a semi-automated manner in a Windows Command Prompt Environment. We exploited comparative GBS with one of the diploid ancestors of finger millet to assign linkage groups to subgenomes and demonstrate the presence of chromosomal rearrangements.ConclusionsThe paper combines GBS protocol modifications, a novel flexible GBS analysis pipeline, UGbS-Flex, recommendations to maximize SNP identification, updated genetic mapping software, and the first high-density maps of finger millet. The modules used in the UGbS-Flex pipeline and for genetic mapping were applied to finger millet, an allotetraploid selfing species without a reference genome, as a case study. The UGbS-Flex modules, which can be run independently, are easily transferable to species with other breeding systems or ploidy levels.Electronic supplementary materialThe online version of this article (10.1186/s12870-018-1316-3) contains supplementary material, which is available to authorized users.
Small millets are very promising agricultural entity to ensure global food security. They gained remarkable importance in agriculture due to their resilience to climatic changes and increasing demand for nutritious food and feed. The genetic variability in the core and mini-core germplasm of small millets was characterized for nutritional composition and capacity to tolerate abiotic stresses that can be infused in breeding programs. Other than the foxtail millet, availability of genomic information in small millets is far below the mark for use in marker-assisted breeding and other genetic improvement programs. The genome sequence of foxtail millet has recently triggered a plethora of post-genomic analysis and envisaged foxtail millet as a model organism for the C 4 grasses and bioenergy research. Recent developments in the next-generation sequencing technologies enabled us, with the simultaneous discovery of high-throughput markers and multiplexed genotyping of germplasm, to speedup markerassisted breeding. In this context, an in-depth analysis of the wealth of diverse germplasm resources and future perspectives of integrating genomics in genome-wide marker-trait association and breeding in small millets is worthy. ABBREVIATIONS cDNA D complementary DNA cM D centimorgan DREB D dehydration response element binding FAO D Food and Agricultural Organization ICAR D Indian Council of Agricultural Research ICRISAT D International Crops Research Institute for the Semi-Arid Tropics Mb D mega basepairs Mha D million hectares Mt D million tonnes Mya D million years ago NARS D National Agricultural Research System NCBI D National Center for Biotechnology Information NGS D next-generation sequencing PCR D polymerase chain reaction PGR D plant genetic resources SNP D single nucleotide polymorphism SSR D simple sequence repeat QTLs D quantitative trait loci
Finger millet is a key food security crop widely grown in eastern Africa, India and Nepal. Long considered a ‘poor man’s crop’, finger millet has regained attention over the past decade for its climate resilience and the nutritional qualities of its grain. To bring finger millet breeding into the 21st century, here we present the assembly and annotation of a chromosome-scale reference genome. We show that this ~1.3 million years old allotetraploid has a high level of homoeologous gene retention and lacks subgenome dominance. Population structure is mainly driven by the differential presence of large wild segments in the pericentromeric regions of several chromosomes. Trait mapping, followed by variant analysis of gene candidates, reveals that loss of purple coloration of anthers and stigma is associated with loss-of-function mutations in the finger millet orthologs of the maize R1/B1 and Arabidopsis GL3/EGL3 anthocyanin regulatory genes. Proanthocyanidin production in seed is not affected by these gene knockouts.
Retting of bast fibres requires removal of pectin, hemicellulose and other non-cellulosic materials from plant stem tissues by a complex microbial community. A microbial retting consortium with high-efficiency pectinolytic bacterial strains is effective in reducing retting-time and enhancing fibre quality. We report comprehensive genomic analyses of three bacterial strains (PJRB 1, 2 and 3) of the consortium and resolve their taxonomic status, genomic features, variations, and pan-genome dynamics. The genome sizes of the strains are ~3.8 Mb with 3729 to 4002 protein-coding genes. Detailed annotations of the protein-coding genes revealed different carbohydrate-degrading CAZy classes viz. PL1, PL9, GH28, CE8, and CE12. Phylogeny and structural features of pectate lyase proteins of PJRB strains divulge their functional uniqueness and evolutionary convergence with closely related Bacillus strains. Genome-wide prediction of genomic variations revealed 12461 to 67381 SNPs, and notably many unique SNPs were localized within the important pectin metabolism genes. The variations in the pectate lyase genes possibly contribute to their specialized pectinolytic function during the retting process. These findings encompass a strong foundation for fundamental and evolutionary studies on this unique microbial degradation of decaying plant material with immense industrial significance. These have preponderant implications in plant biomass research and food industry, and also posit application in the reclamation of water pollution from plant materials.
Flax (
Linum usitatissimum
) is a cool season crop commercially cultivated for seed oil and stem fibre production. A comprehensive characterization of the
heat shock factor
(
HSF
) candidate genes in flax can accelerate genetic improvement and adaptive breeding for high temperature stress tolerance. We report the genome-wide identification of 34 putative
HSF
genes from the flax genome, which we mapped on 14 of the 15 chromosomes. Through comparative homology analysis, we classified these genes into three broad groups, and sub-groups. The arrangement of HSF-specific protein motifs, DNA-binding domain (DBD) and hydrophobic heptad repeat (HR-A/B), and exon-intron boundaries substantiated the phylogenetic separation of these genes. Orthologous relationships and evolutionary analysis revealed that the co-evolution of the
LusHSF
genes was due to recent genome duplication events. Digital and RT-qPCR analyses provided significant evidence of the differential expression of the
LusHSF
genes in various tissues, at various developmental stages, and in response to high-temperature stress. The co-localization of diverse cis-acting elements in the promoters of the
LusHSF
genes further emphasized their regulatory roles in the abiotic stress response. We further confirmed DNA-binding sites on the LusHSF proteins and designed guide RNA sequences for gene editing with minimal off-target effects. These results will hasten functional investigations of
LusHSFs
or assist in devising genome engineering strategies to develop high-temperature stress tolerant flax cultivars.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.