Hevea brasiliensis (rubber tree) is a large tree species of the Euphorbiaceae family with inestimable economic importance. Rubber tree breeding programs currently aim to improve growth and production, and the use of early genotype selection technologies can accelerate such processes, mainly with the incorporation of genomic tools, such as marker-assisted selection (MAS). However, few quantitative trait loci (QTLs) have been used successfully in MAS for complex characteristics. Recent research shows the efficiency of genome-wide association studies (GWAS) for locating QTL regions in different populations. In this way, the integration of GWAS, RNA-sequencing (RNA-Seq) methodologies, coexpression networks and enzyme networks can provide a better understanding of the molecular relationships involved in the definition of the phenotypes of interest, supplying research support for the development of appropriate genomic based strategies for breeding. In this context, this work presents the potential of using combined multiomics to decipher the mechanisms of genotype and phenotype associations involved in the growth of rubber trees. Using GWAS from a genotyping-by-sequencing (GBS) Hevea population, we were able to identify molecular markers in QTL regions with a main effect on rubber tree plant growth under constant water stress. The underlying genes were evaluated and incorporated into a gene coexpression network modelled with an assembled RNA-Seq-based transcriptome of the species, where novel gene relationships were estimated and evaluated through in silico methodologies, including an estimated enzymatic network. From all these analyses, we were able to estimate not only the main genes involved in defining the phenotype but also the interactions between a core of genes related to rubber tree growth at the transcriptional and translational levels. This work was the first to integrate multiomics analysis into the in-depth investigation of rubber tree plant growth, producing useful data for future genetic studies in the species and enhancing the efficiency of the species improvement programs.
Poaceae, among the most abundant plant families, includes many economically important polyploid species, such as forage grasses and sugarcane (Saccharum spp.). These species have elevated genomic complexities and limited genetic resources, hindering the application of marker-assisted selection strategies. Currently, the most promising approach for increasing genetic gains in plant breeding is genomic selection. However, due to the polyploidy nature of these polyploid species, more accurate models for incorporating genomic selection into breeding schemes are needed. This study aims to develop a machine learning method by using a joint learning approach to predict complex traits from genotypic data. Biparental populations of sugarcane and two species of forage grasses (Urochloa decumbens, Megathyrsus maximus) were genotyped, and several quantitative traits were measured. High-quality markers were used to predict several traits in different cross-validation scenarios. By combining classification and regression strategies, we developed a predictive system with promising results. Compared with traditional genomic prediction methods, the proposed strategy achieved accuracy improvements exceeding 50%. Our results suggest that the developed methodology could be implemented in breeding programs, helping reduce breeding cycles and increase genetic gains.
Plant stomata are essential structures (pores) that control the exchange of gases between plant leaves and the atmosphere, and also they influence plant adaptation to climate through photosynthesis and transpiration stream. Many works in literature aim for a better understanding of these structures and their role in the evolution process and the behavior of plants. Although stomata studies in dicots species have advanced considerably in the past years, even there is not much knowledge about the stomata of cereal grasses. Due to the high morphological variation of stomata traits intra- and inter-species, detecting and classifying stomata automatically becomes challenging. For this reason, in this work, we propose a new system for automatic stomata classification and detection in microscope images for maize cultivars based on transfer learning strategy of different deep convolution neural netwoks (DCNN). Our performed experiments show that our system achieves an approximated accuracy of 97.1% in identifying stomata regions using classifiers based on deep learning features, which figures out as a nearly perfect classification system. As the stomata are responsible for several plant functionalities, this work represents an important advance for maize research, providing an accurate system in replacing the current manual task of categorizing these pores on microscope images. Furthermore, this system can also be a reference for studies using images from different cereal grasses.
The breeding of sugarcane, a leading sugar and energy crop, is complicated by the extremely complex sugarcane genome, which burdens research in the area and delays the development of new cultivars. One of the main viral diseases that affect this crop is sugarcane yellow leaf (SCYL), which is caused by the sugarcane yellow leaf virus (SCYLV). The most common symptom of SCYL is the yellowing of leaf midribs and blades, but asymptomatic cases are frequent. Regardless of the manifestation of SCYL, infection by SCYLV can lead to substantial yield losses, making resistance to this virus highly relevant to sugarcane breeding. However, the genetic basis of this trait has not been widely explored or explained. In this context, genome-wide association studies (GWASs) have arisen as promising tools for the identification of molecular markers associated with SCYLV resistance that can be employed in marker-assisted selection. In the present work, we performed a GWAS on sugarcane using codominant markers and genotypes of interest for breeding. A panel of 97 sugarcane genotypes inoculated with SCYLV was analyzed for SCYL symptom severity, and viral titer was estimated by reverse transcription quantitative PCR (RT-qPCR). A genotyping-by-sequencing (GBS) library was constructed for 94 individuals of this population, enabling the identification of 38,710 SNPs and 32,178 indels with information on allele proportion (AP) and position on the Saccharum spontaneum genome. For association analyses, several combinations of population structure and kinship were tested to reduce model inflation, and diverse marker-trait association mixed models were employed. We identified 35 markers significantly associated with SCYL symptom severity and 22 markers strongly associated with SCYLV titer that can be applied in breeding programs upon validation. By aligning the sequences flanking these markers with their coding sequences in several plant species, we annotated the functions of 7 genes. The possible involvement of these candidates in the response to SCYLV infection is discussed.
Trichoderma harzianum, whose gene expression is tightly controlled by the transcription factors (TFs) XYR1 and CRE1, is a potential candidate for hydrolytic enzyme production. Here, we performed a network analysis of T. harzianum IOC-3844 and T. harzianum CBMAI-0179 to explore how the regulation of these TFs varies between these strains. In addition, we explored the evolutionary relationships of XYR1 and CRE1 protein sequences among Trichoderma spp. The results of the T. harzianum strains were compared with those of Trichoderma atroviride CBMAI-0020, a mycoparasitic species. Although transcripts encoding carbohydrate-active enzymes (CAZymes), TFs, transporters, and proteins with unknown functions were coexpressed with cre1 or xyr1, other proteins indirectly related to cellulose degradation were identified. The enriched GO terms describing the transcripts of these groups differed across all strains, and several metabolic pathways with high similarity between both regulators but strain-specific differences were identified. In addition, the CRE1 and XYR1 subnetworks presented different topology profiles in each strain, likely indicating differences in the influences of these regulators according to the fungi. The hubs of the cre1 and xyr1 groups included transcripts not yet characterized or described as being related to cellulose degradation. The first-neighbor analyses confirmed the results of the profile of the coexpressed transcripts in cre1 and xyr1. The analyses of the shortest paths revealed that CAZymes upregulated under cellulose degradation conditions are most closely related to both regulators, and new targets between such signaling pathways were discovered. Although the evaluated T. harzianum strains are phylogenetically close and their amino acid sequences related to XYR1 and CRE1 are very similar, the set of transcripts related to xyr1 and cre1 differed, suggesting that each T. harzianum strain used a specific regulation strategy for cellulose degradation. More interestingly, our findings may suggest that XYR1 and CRE1 indirectly regulate genes encoding proteins related to cellulose degradation in the evaluated T. harzianum strains. An improved understanding of the basic biology of fungi during the cellulose degradation process can contribute to the use of their enzymes in several biotechnological applications and pave the way for further studies on the differences across strains of the same species.
Artificial hybridization plays a fundamental role in plant breeding programs since it generates new genotypic combinations that can result in desirable phenotypes. Depending on the species and mode of reproduction, controlled crosses may be challenging, and contaminating individuals can be introduced accidentally. In this context, the identification of such contaminants is important to avoid compromising further selection cycles, as well as genetic and genomic studies. The main objective of this work was to propose an automated multivariate methodology for the detection and classification of putative contaminants, including apomictic clones (ACs), self-fertilized individuals, half-siblings (HSs), and full contaminants (FCs), in biparental polyploid progenies of tropical forage grasses. We established a pipeline to identify contaminants in genotyping-by-sequencing (GBS) data encoded as allele dosages of single nucleotide polymorphism (SNP) markers by integrating principal component analysis (PCA), genotypic analysis (GA) measures based on Mendelian segregation, and clustering analysis (CA). The combination of these methods allowed for the correct identification of all contaminants in all simulated progenies and the detection of putative contaminants in three real progenies of tropical forage grasses, providing an easy and promising methodology for the identification of contaminants in biparental progenies of tetraploid and hexaploid species. The proposed pipeline was made available through the polyCID Shiny app and can be easily coupled with traditional genetic approaches, such as linkage map construction, thereby increasing the efficiency of breeding programs.
The protein kinase (PK) superfamily constitutes one of the largest and most conserved protein families in eukaryotic genomes, comprising core components of signaling pathways in cell regulation. Despite its remarkable relevance, only a few kinase families have been studied in Hevea brasiliensis. A comprehensive characterization and global expression analysis of the PK superfamily, however, is currently lacking. In this study, with the aim of providing novel inferences about the mechanisms associated with the stress response developed by PKs and retained throughout evolution, we identified and characterized the entire set of PKs, also known as the kinome, present in the Hevea genome. Different RNA-sequencing datasets were employed to identify tissue-specific expression patterns and potential correspondences between different rubber tree genotypes. In addition, coexpression networks under several abiotic stress conditions, such as cold, drought and latex overexploitation, were employed to elucidate associations between families and tissues/stresses. A total of 1,809 PK genes were identified using the current reference genome assembly at the scaffold level, and 1,379 PK genes were identified using the latest chromosome-level assembly and combined into a single set of 2,842 PKs. These proteins were further classified into 20 different groups and 122 families, exhibiting high compositional similarities among family members and with two phylogenetically close species Manihot esculenta and Ricinus communis. Through the joint investigation of tandemly duplicated kinases, transposable elements, gene expression patterns, and coexpression events, we provided insights into the understanding of the cell regulation mechanisms in response to several conditions, which can often lead to a significant reduction in rubber yield.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.