Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.
Common bean (Phaseolus vulgaris L.) is the most important grain legume for human consumption and has a role in sustainable agriculture owing to its ability to fix atmospheric nitrogen. We assembled 473 Mb of the 587-Mb genome and genetically anchored 98% of this sequence in 11 chromosome-scale pseudomolecules. We compared the genome for the common bean against the soybean genome to find changes in soybean resulting from polyploidy. Using resequencing of 60 wild individuals and 100 landraces from the genetically differentiated Mesoamerican and Andean gene pools, we confirmed 2 independent domestications from genetic pools that diverged before human colonization. Less than 10% of the 74 Mb of sequence putatively involved in domestication was shared by the two domestication events. We identified a set of genes linked with increased leaf and seed size and combined these results with quantitative trait locus data from Mesoamerican cultivars. Genes affected by domestication may be useful for genomics-enabled crop improvement.
Cultivated peanut (Arachis hypogaea) is an allotetraploid with closely related subgenomes of a total size of ~2.7 Gb. This makes the assembly of chromosomal pseudomolecules very challenging. As a foundation to understanding the genome of cultivated peanut, we report the genome sequences of its diploid ancestors (Arachis duranensis and Arachis ipaensis). We show that these genomes are similar to cultivated peanut's A and B subgenomes and use them to identify candidate disease resistance genes, to guide tetraploid transcript assemblies and to detect genetic exchange between cultivated peanut's subgenomes. On the basis of remarkably high DNA identity of the A. ipaensis genome and the B subgenome of cultivated peanut and biogeographic evidence, we conclude that A. ipaensis may be a direct descendant of the same population that contributed the B subgenome to cultivated peanut. A r t i c l e s npg © 2016 Nature America, Inc. All rights reserved.Nature GeNetics VOLUME 48 | NUMBER 4 | APRIL 2016 4 3 9 subgenomes of A. hypogaea. Progeny are vigorous, phenotypically normal and fertile and showed lower segregation distortion 16,17 than has been observed for some populations derived from A. hypogaea intraspecific crosses [18][19][20][21] . Therefore, as a first step to characterizing the genome of cultivated peanut, we sequenced and analyzed the genomes of the two diploid ancestors of cultivated peanut. RESULTS Sequencing and assembly of the diploid A and B genomesConsidering that A. duranensis V14167 and A. ipaensis K30076 are likely good representatives of the ancestral species of A. hypogaea, we sequenced their genomes. After filtering, the data generated from the seven paired-end libraries corresponded to an estimated 154× and 163× base-pair coverage for A. duranensis and A. ipaensis, respectively (Supplementary Tables 1-6). The total assembly sizes were 1,211 and 1,512 Mb for A. duranensis and A. ipaensis, respectively, of which 1,081 and 1,371 Mb were represented in scaffolds of 10 kb or greater in size (Supplementary Table 7). Ultradense genetic maps were generated through genotyping by sequencing (GBS) of two diploid recombinant inbred line (RIL) populations (Supplementary Data Set 1). SNPs within scaffolds were used to validate the assemblies and confirmed their high quality; 190 of 1,297 initial scaffolds of A. duranensis and 49 of 353 initial scaffolds of A. ipaensis were identified as chimeric, on the basis of the presence of diagnostic population-wide switches in genotype calls occurring at the point of misjoin. Chimeric scaffolds were split, and their components were remapped. Thus, approximate chromosomal placements were obtained for 1,692 and 459 genetically verified scaffolds, respectively. Conventional molecular marker maps (Supplementary Data Set 2) and syntenic inferences were then used to refine the ordering of scaffolds within the initial genetic bins. Generally, agreement was good for maps in euchromatic arms and poorer in pericentromeric regions (although one map 22 showed large inversions in two lin...
Legumes and many nonleguminous plants enter symbiotic interactions with microbes, and it is poorly understood how host plants respond to promote beneficial, symbiotic microbial interactions while suppressing those that are deleterious or pathogenic. Trans-acting siRNAs (tasiRNAs) negatively regulate target transcripts and are characterized by siRNAs spaced in 21-nucleotide (nt) ''phased'' intervals, a pattern formed by DICER-LIKE 4 (DCL4) processing. A search for phased siRNAs (phasiRNAs) found at least 114 Medicago loci, the majority of which were defense-related NB-LRR-encoding genes. We identified three highly abundant 22-nt microRNA (miRNA) families that target conserved domains in these NB-LRRs and trigger the production of trans-acting siRNAs. High levels of small RNAs were matched to >60% of all~540 encoded Medicago NB-LRRs; in the potato, a model for mycorrhizal interactions, phasiRNAs were also produced from NB-LRRs. DCL2 and SGS3 transcripts were also cleaved by these 22-nt miRNAs, generating phasiRNAs, suggesting synchronization between silencing and pathogen defense pathways. In addition, a new example of apparent ''two-hit'' phasiRNA processing was identified. Our data reveal complex tasiRNA-based regulation of NB-LRRs that potentially evolved to facilitate symbiotic interactions and demonstrate miRNAs as master regulators of a large gene family via the targeting of highly conserved, protein-coding motifs, a new paradigm for miRNA function.
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, has raised concerns about the reliability of this technology. The MicroArray Quality Control (MAQC) project was initiated to address these concerns, as well as other performance and data analysis issues. Expression data on four titration pools from two distinct reference RNA samples were generated at multiple test sites using a variety of microarray-based and alternative technology platforms. Here we describe the experimental design and probe mapping efforts behind the MAQC project. We show intraplatform consistency across test sites as well as a high level of interplatform concordance in terms of genes identified as differentially expressed. This study provides a resource that represents an important first step toward establishing a framework for the use of microarrays in clinical and regulatory settings.
Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sangerbased bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.