Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sangerbased bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.
The life cycle of higher plants alternates between a haploid gametophytic generation and a diploid sporophytic generation. The female and male gametes formed during the gametophytic phase fuse during fertilization to generate the sporophytic phase of the life cycle. Most sexually reproducing diploid plants undergo double fertilization, during which the egg cell and the homodiploid central cell are each fertilized by a sperm cell and give rise to the embryo and the triploid endosperm, respectively (Grossniklaus and Schneitz 1998).In analogy to the Greek myth in which the priestess Medea killed her children Pheres and Meidos (Euripides 431 BC; Wolf 1996), a maternal effect mutant identified in a screen for gametophytic mutants in Arabidopsis thaliana was named medea (mea; . Additional alleles of the MEA gene, termed FER-TILIZATION INDEPENDENT SEED DEVELOPMENT (FIS1), as well as mutations at two other loci, FIS2 and FERTILISATION INDEPENDENT ENDOSPERM (FIE or FIS3), were identified in genetic screens for mutants displaying seed development in the absence of fertilization (Ohad et al. 1996;Chaudhury et al. 1997). The three fisclass mutants show a gametophytic maternal effect: all seeds derived from a mutant female gametophyte (50% in a heterozygote) abort irrespective of the paternal allele. Early development of fis embryos is morphologically indistinguishable from that of wild-type siblings. However, fis embryogenesis is delayed after the globular stage and eventually arrests with oversized heart-shaped embryos surrounded by an abnormally proliferated en-
Camelina sativa is an oilseed with desirable agronomic and oil-quality attributes for a viable industrial oil platform crop. Here we generate the first chromosome-scale high-quality reference genome sequence for C. sativa and annotated 89,418 protein-coding genes, representing a whole-genome triplication event relative to the crucifer model Arabidopsis thaliana. C. sativa represents the first crop species to be sequenced from lineage I of the Brassicaceae. The well-preserved hexaploid genome structure of C. sativa surprisingly mirrors those of economically important amphidiploid Brassica crop species from lineage II as well as wheat and cotton. The three genomes of C. sativa show no evidence of fractionation bias and limited expression-level bias, both characteristics commonly associated with polyploid evolution. The highly undifferentiated polyploid genome of C. sativa presents significant consequences for breeding and genetic manipulation of this industrial oil crop.
Genomic imprinting is an epigenetic phenomenon leading to parent-of-origin specific differential expression of maternally and paternally inherited alleles. In plants, genomic imprinting has mainly been observed in the endosperm, an ephemeral triploid tissue derived after fertilization of the diploid central cell with a haploid sperm cell. In an effort to identify novel imprinted genes in Arabidopsis thaliana, we generated deep sequencing RNA profiles of F1 hybrid seeds derived after reciprocal crosses of Arabidopsis Col-0 and Bur-0 accessions. Using polymorphic sites to quantify allele-specific expression levels, we could identify more than 60 genes with potential parent-of-origin specific expression. By analyzing the distribution of DNA methylation and epigenetic marks established by Polycomb group (PcG) proteins using publicly available datasets, we suggest that for maternally expressed genes (MEGs) repression of the paternally inherited alleles largely depends on DNA methylation or PcG-mediated repression, whereas repression of the maternal alleles of paternally expressed genes (PEGs) predominantly depends on PcG proteins. While maternal alleles of MEGs are also targeted by PcG proteins, such targeting does not cause complete repression. Candidate MEGs and PEGs are enriched for cis-proximal transposons, suggesting that transposons might be a driving force for the evolution of imprinted genes in Arabidopsis. In addition, we find that MEGs and PEGs are significantly faster evolving when compared to other genes in the genome. In contrast to the predominant location of mammalian imprinted genes in clusters, cluster formation was only detected for few MEGs and PEGs, suggesting that clustering is not a major requirement for imprinted gene regulation in Arabidopsis.
Small non-coding miRNAs (microRNAs) are emerging as key factors involved in cancer at all stages ranging from initiation to metastasis. MIRN21 is an miRNA gene that codes for the miR-21 miRNA which has been found to be overexpressed in many tumour samples where it has been analysed. Whereas consistent overexpression of miR-21 in tumours could be suggestive of functional effects of miR-21 in cancer, more in-depth functional studies with miR-21 are demonstrating that mir-21 displays oncogenic activity and can be classed as an oncomir. Extensive efforts are underway to identify the downstream genes and gene networks regulated by miR-21 and to identify the upstream factors that are regulating expression of miR-21. Even though miR-21 is one of the most intensively studied miRNAs, for all miRNAs, our understanding of miRNA signalling pathways is currently in its early stages. The unravelling of such RNA signalling pathways and networks will be key to understanding the role that dysregulated miRNA functioning can play in oncogenic processes.
In higher plants, seed development requires maternal gene activity in the haploid (gametophytic) as well as diploid (sporophytic) tissues of the developing ovule. The Arabidopsis thaliana gene MEDEA (MEA) encodes a SET-domain protein of the Polycomb group that regulates cell proliferation by exerting a gametophytic maternal control during seed development. Seeds derived from female gametocytes (embryo sacs) carrying a mutant mea allele abort and exhibit cell proliferation defects in both the embryo and the endosperm. In this study we show that the mea mutation affects an imprinted gene expressed maternally in cells of the female gametophyte and after fertilization only from maternally inherited MEA alleles. Paternally inherited MEA alleles are transcriptionally silent in both the young embryo and endosperm. Mutations at the decrease in DNA methylation1 (ddm1) locus are able to rescue mea seeds by functionally reactivating paternally inherited MEA alleles during seed development. Rescued seeds are larger than the wild type and exhibit some of the abnormalities found in aborting mea seeds. Our results indicate that the maintenance of the genomic imprint at the mea locus requires zygotic DDM1 activity. Because DDM1 encodes a putative chromatin remodeling factor, chromatin structure is likely to be interrelated with genomic imprinting in Arabidopsis.
BackgroundAll sequenced genomes contain a proportion of lineage-specific genes, which exhibit no sequence similarity to any genes outside the lineage. Despite their prevalence, the origins and functions of most lineage-specific genes remain largely unknown. As more genomes are sequenced opportunities for understanding evolutionary origins and functions of lineage-specific genes are increasing.ResultsThis study provides a comprehensive analysis of the origins of lineage-specific genes (LSGs) in Arabidopsis thaliana that are restricted to the Brassicaceae family. In this study, lineage-specific genes within the nuclear (1761 genes) and mitochondrial (28 genes) genomes are identified. The evolutionary origins of two thirds of the lineage-specific genes within the Arabidopsis thaliana genome are also identified. Almost a quarter of lineage-specific genes originate from non-lineage-specific paralogs, while the origins of ~10% of lineage-specific genes are partly derived from DNA exapted from transposable elements (twice the proportion observed for non-lineage-specific genes). Lineage-specific genes are also enriched in genes that have overlapping CDS, which is consistent with such novel genes arising from overprinting. Over half of the subset of the 958 lineage-specific genes found only in Arabidopsis thaliana have alignments to intergenic regions in Arabidopsis lyrata, consistent with either de novo origination or differential gene loss and retention, with both evolutionary scenarios explaining the lineage-specific status of these genes. A smaller number of lineage-specific genes with an incomplete open reading frame across different Arabidopsis thaliana accessions are further identified as accession-specific genes, most likely of recent origin in Arabidopsis thaliana. Putative de novo origination for two of the Arabidopsis thaliana-only genes is identified via additional sequencing across accessions of Arabidopsis thaliana and closely related sister species lineages. We demonstrate that lineage-specific genes have high tissue specificity and low expression levels across multiple tissues and developmental stages. Finally, stress responsiveness is identified as a distinct feature of Brassicaceae-specific genes; where these LSGs are enriched for genes responsive to a wide range of abiotic stresses.ConclusionImproving our understanding of the origins of lineage-specific genes is key to gaining insights regarding how novel genes can arise and acquire functionality in different lineages. This study comprehensively identifies all of the Brassicaceae-specific genes in Arabidopsis thaliana and identifies how the majority of such lineage-specific genes have arisen. The analysis allows the relative importance (and prevalence) of different evolutionary routes to the genesis of novel ORFs within lineages to be assessed. Insights regarding the functional roles of lineage-specific genes are further advanced through identification of enrichment for stress responsiveness in lineage-specific genes, highlighting their likely importanc...
BackgroundDomestication of the now-extinct wild aurochs, Bos primigenius, gave rise to the two major domestic extant cattle taxa, B. taurus and B. indicus. While previous genetic studies have shed some light on the evolutionary relationships between European aurochs and modern cattle, important questions remain unanswered, including the phylogenetic status of aurochs, whether gene flow from aurochs into early domestic populations occurred, and which genomic regions were subject to selection processes during and after domestication. Here, we address these questions using whole-genome sequencing data generated from an approximately 6,750-year-old British aurochs bone and genome sequence data from 81 additional cattle plus genome-wide single nucleotide polymorphism data from a diverse panel of 1,225 modern animals.ResultsPhylogenomic analyses place the aurochs as a distinct outgroup to the domestic B. taurus lineage, supporting the predominant Near Eastern origin of European cattle. Conversely, traditional British and Irish breeds share more genetic variants with this aurochs specimen than other European populations, supporting localized gene flow from aurochs into the ancestors of modern British and Irish cattle, perhaps through purposeful restocking by early herders in Britain. Finally, the functions of genes showing evidence for positive selection in B. taurus are enriched for neurobiology, growth, metabolism and immunobiology, suggesting that these biological processes have been important in the domestication of cattle.ConclusionsThis work provides important new information regarding the origins and functional evolution of modern cattle, revealing that the interface between early European domestic populations and wild aurochs was significantly more complex than previously thought.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-015-0790-2) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.