Boundless Bio, Inc. (BB), and serve as consultants. V.B. is a co-founder, and has equity interest in Boundless Bio, inc. (BB) and Digital Proteomics, LLC (DP), and receives income from DP. The terms of this arrangement have been reviewed and approved by the University of California, San Diego in accordance with its conflict of interest policies. BB and DP were not involved in the research presented here. Data Availability. Whole genome-, RNA-, ATAC-, MNase-, ChIP-, PLAC-Seq data are deposited in the NCBI Sequence Read Archive (BioProject: PRJNA506071). The source data files of the pixel quantification of ATAC-see on metaphase chromosome spread images to create Extended Data Figure 7d are available on Figshare (
High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq libraries often have a high proportion of reads that align to multiple genomic locations, which makes determining their true origins difficult. Commonly used sRNA-seq alignment methods result in either very low precision (choosing an alignment at random), or sensitivity (ignoring multi-mapping reads). Here, we describe and test an sRNA-seq alignment strategy that uses local genomic context to guide decisions on proper placements of multi-mapped sRNA-seq reads. Tests using simulated sRNA-seq data demonstrated that this local-weighting method outperforms other alignment strategies using three different plant genomes. Experimental analyses with real sRNA-seq data also indicate superior performance of local-weighting methods for both plant miRNAs and heterochromatic siRNAs. The local-weighting methods we have developed are implemented as part of the sRNA-seq analysis program ShortStack, which is freely available under a general public license. Improved genome alignments of sRNA-seq data should increase the quality of downstream analyses and genome annotation efforts.
Twenty-one-nucleotide microRNAs (miRNAs) and 24-nucleotide Pol IV-dependent small interfering RNAs (p4-siRNAs) are the most abundant types of small RNAs in angiosperms. Some miRNAs are well conserved among different plant lineages, whereas others are less conserved, and it is not clear whether less-conserved miRNAs have the same functionality as the well conserved ones. p4-siRNAs are broadly produced in the Arabidopsis genome, sometimes from active hot spot loci, but it is unknown whether individual p4-siRNA hot spots are retained as hot spots between plant species. In this study, we compare small RNAs in two closely related species (Arabidopsis thaliana and Arabidopsis lyrata) and find that lessconserved miRNAs have high rates of divergence in MIRNA hairpin structures, mature miRNA sequences, and targetcomplementary sites in the other species. The fidelity of miRNA biogenesis from many less-conserved MIRNA hairpins frequently deteriorates in the sister species relative to the species of first discovery. We also observe that p4-siRNA occupied loci have a slight tendency to be retained as p4-siRNA loci between species, but the most active A. lyrata p4-siRNA hot spots are generally not syntenic to the most active p4-siRNA hot spots of A. thaliana. Altogether, our findings indicate that many MIRNAs and most p4-siRNA hot spots are rapidly changing and evolutionarily transient within the Arabidopsis genus.
Dodders (Cuscuta spp.) are obligate parasitic plants that obtain water and nutrients from the stems of host plants via specialized feeding structures called haustoria. Dodder haustoria facilitate bidirectional movement of viruses, proteins and mRNAs between host and parasite, but the functional effects of these movements are not known. Here we show that Cuscuta campestris haustoria accumulate high levels of many novel microRNAs (miRNAs) while parasitizing Arabidopsis thaliana. Many of these miRNAs are 22 nucleotides in length. Plant miRNAs of this length are uncommon, and are associated with amplification of target silencing through secondary short interfering RNA (siRNA) production. Several A. thaliana mRNAs are targeted by 22-nucleotide C. campestris miRNAs during parasitism, resulting in mRNA cleavage, secondary siRNA production, and decreased mRNA accumulation. Hosts with mutations in two of the loci that encode target mRNAs supported significantly higher growth of C. campestris. The same miRNAs that are expressed and active when C. campestris parasitizes A. thaliana are also expressed and active when it infects Nicotiana benthamiana. Homologues of target mRNAs from many other plant species also contain the predicted target sites for the induced C. campestris miRNAs. These data show that C. campestris miRNAs act as trans-species regulators of host-gene expression, and suggest that they may act as virulence factors during parasitism.
Endogenous 24 nt short interfering RNAs (siRNAs), derived mostly from intergenic and repetitive genomic regions, constitute a major class of endogenous small RNAs in flowering plants. Accumulation of Arabidopsis thaliana 24 nt siRNAs requires the Dicer family member DCL3, and clear homologs of DCL3 exist in both flowering and non-flowering plants. However, the absence of a conspicuous 24 nt peak in the total RNA populations of several non-flowering plants has raised the question of whether this class of siRNAs might, in contrast to the ancient 21 nt microRNAs (miRNAs) and 21–22 nt trans-acting siRNAs (tasiRNAs), be an angiosperm-specific innovation. Analysis of non-miRNA, non-tasiRNA hotspots of small RNA production within the genome of the moss Physcomitrella patens revealed multiple loci that consistently produced a mixture of 21–24 nt siRNAs with a peak at 23 nt. These Pp23SR loci were significantly enriched in transposon content, depleted in overlap with annotated genes, and typified by dense concentrations of the 5-methyl cytosine (5 mC) DNA modification. Deep sequencing of small RNAs from two independent Ppdcl3 mutants showed that the P. patens DCL3 homolog is required for the accumulation of 22–24 nt siRNAs, but not 21 nt siRNAs, at Pp23SR loci. The 21 nt component of Pp23SR-derived siRNAs was also unaffected by a mutation in the RNA-dependent RNA polymerase mutant Pprdr6. Transcriptome-wide, Ppdcl3 mutants failed to accumulate 22–24 nt small RNAs from repetitive regions while transcripts from two abundant families of long terminal repeat (LTR) retrotransposon-associated reverse transcriptases were up-regulated. Ppdcl3 mutants also displayed an acceleration of leafy gametophore production, suggesting that repetitive siRNAs may play a role in the development of P. patens. We conclude that intergenic/repeat-derived siRNAs are indeed a broadly conserved, distinct class of small regulatory RNAs within land plants.
microRNA156 (miR156) affects developmental timing in flowering plants. miR156 and its target relationships with members of the SQUAMOSA PROMOTER BINDING PROTEIN-LIKE (SPL) gene family appear universally conserved in land plants, but the specific functions of miR156 outside of flowering plants are unknown. We find that miR156 promotes a developmental change from young filamentous protonemata to leafy gametophores in the moss Physcomitrella patens, opposite to its role as an inhibitor of development in flowering plants. P. patens miR156 also influences accumulation of trans-acting small interfering RNAs (tasiRNAs) dependent upon a second ancient microRNA, miR390. Both miR156 and miR390 directly target a single major tasiRNA primary transcript. Inhibition of miR156 function causes increased miR390-triggered tasiRNA accumulation and decreased accumulation of tasiRNA targets. Overexpression of miR390 also caused a slower formation of gametophores, elevated miR390-triggered tasiRNA accumulation, and reduced level of tasiRNA targets. We conclude that a gene regulatory network controlled by miR156, miR390, and their targets controls developmental change in P. patens. The broad outlines and regulatory logic of this network are conserved in flowering plants, albeit with some modifications. Partially conserved small RNA networks thus influence developmental timing in plants with radically different body plans.
First paragraphDodders (Cuscuta spp.) are obligate parasitic plants that obtain water and nutrients from the stems of host plants via specialized feeding structures called haustoria. Dodder haustoria facilitate bi-directional movement of viruses, proteins, and mRNAs between host and parasite1, but the functional effects of these movements are not clear. Here we show that C. campestris haustoria accumulate high levels of many novel microRNAs (miRNAs) while parasitizing Arabidopsis thaliana hosts. Many of these miRNAs are 22 nts long, a usually rare size of plant miRNA associated with amplification of target silencing through secondary small interfering RNA (siRNA) production2. Several A. thaliana mRNAs are targeted by C. campestris 22 nt miRNAs during parasitism, resulting in mRNA cleavage, secondary siRNA production, and decreased mRNA accumulation levels. Hosts with mutations in two of the targets supported significantly higher growth of C. campestris. Homologs of target mRNAs from diverse plants also have predicted target sites to induced C. campestris miRNAs, and the same miRNAs are expressed and active against host targets when C. campestris parasitizes a different host, Nicotiana benthamiana. These data show that C. campestris miRNAs act as trans-species regulators of host gene expression, and suggest that they may act as virulence factors during parasitism.
Oncogene amplification, a major driver of cancer pathogenicity, is often mediated through focal amplification of genomic segments. Recent results implicate extrachromosomal DNA (ecDNA) as the primary driver of focal copy number amplification (fCNA)-enabling gene amplification, rapid tumor evolution, and the rewiring of regulatory circuitry. Resolving an fCNA's structure is a first step in deciphering the mechanisms of its genesis and the fCNA's subsequent biological consequences. We introduce a computational method, AmpliconReconstructor (AR), for integrating optical mapping (OM) of long DNA fragments (>150 kb) with next-generation sequencing (NGS) to resolve fCNAs at single-nucleotide resolution. AR uses an NGS-derived breakpoint graph alongside OM scaffolds to produce high-fidelity reconstructions. After validating its performance through multiple simulation strategies, AR reconstructed fCNAs in seven cancer cell lines to reveal the complex architecture of ecDNA, a breakage-fusion-bridge and other complex rearrangements. By reconstructing the rearrangement signatures associated with an fCNA's generative mechanism, AR enables a more thorough understanding of the origins of fCNAs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.