The first evidence for the emergence of land plants (embryophytes) consists of mid-Ordovician spore tetrads (approximately 476 Myr old). The identity of the early plants that produced these spores is unclear; they are sometimes claimed to be liverworts, but there are no associated megafossils, and similar spores can be produced by a diversity of plants. Indeed, the earliest unequivocal megafossils of land plants consist of early vascular plants and various plants of uncertain affinity. Different phylogenetic analyses have identified liverworts, hornworts and bryophytes as each being the first lineage of land plants; the consensus of these conflicting topologies yields an unresolved polychotomy at the base of land plants. Here we survey 352 diverse land plants and find that three mitochondrial group II introns are present, with occasional losses, in mosses, hornworts and all major lineages of vascular plants, but are entirely absent from liverworts, green algae and all other eukaryotes. These results indicate that liverworts are the earliest land plants, with the three introns having been acquired in a common ancestor of all other land plants, and have important implications concerning the early stages of plant evolution.
A quantitative description of the relationship between protein expression levels and open reading frame nucleotide sequences (ORFs) is important for understanding natural systems, designing synthetic systems, and optimizing heterologous expression. Codon identity, mRNA secondary structure, and nucleotide composition within ORFs markedly influence expression levels. Bioinformatic analysis of ORF sequences in 816 bacterial genomes revealed that these features show distinct regional trends. To investigate their effects on protein expression, we designed 285 synthetic genes and determined corresponding expression levels in vitro using E. coli extracts. We developed a mathematical function, parameterized using this synthetic gene dataset, which enables computation of protein expression levels from ORF nucleotide sequences. In addition to its practical application in the design of heterologous expression systems, this equation provides mechanistic insight into the factors that control translation efficiency. We found that expression is strongly dependent on the presence of high AU content and low secondary structure in the ORF 5′ region. Choice of high-frequency codons contributes to a lesser extent. The 3′ terminal AU content makes modest, but detectable contributions. We present a model for the effect of these factors on the three phases of ribosomal function: initiation, elongation, and termination.
In vitro selection can be used to generate nucleic acid ligands (aptamers) to target molecules ranging in size and structure from cations to cells. However, the selection process is repetitive and time-consuming. We have automated a protocol for in vitro selection using an augmented Beckman Biomek 2000 pipetting robot. The automated selection procedure requires the integration of four devices and the optimization of four molecular biology methods, and is one of the most complex automated protocols attempted to date. Initial attempts at selection yielded robust replication parasites, but optimization of the automated selection procedure suppressed the emergence of these parasites and led to the selection of true nucleic acid ligands. Automated selection can now be used to generate nucleic acid aptamers in days rather than weeks or months.
We have created an Amino Acid-Nucleotide Interaction Database (AANT; http://aant.icmb.utexas. edu/) that categorizes all amino acid-nucleotide interactions from experimentally determined protein-nucleic acid structures, and provides users with a graphic interface for visualizing these interactions in aggregate. AANT accomplishes this by extracting individual amino acid-nucleotide interactions from structures in the Protein Data Bank, combining and superimposing these interactions into multiple structure files (e.g. 20 amino acids x 5 nucleotides) and grouping structurally similar interactions into more readily identifiable clusters. Using the Chime web browser plug-in, users can view 3D representations of the superimpositions and clusters. The unique collection and representation of data on amino acid-nucleotide interactions facilitates understanding the specificity of protein-nucleic acid interactions at a more fundamental level, and allows comparison of otherwise extremely disparate sets of structures. Moreover, by modularly representing the fundamental interactions that govern binding specificity it may prove possible to better engineer nucleic acid binding proteins.
Torque teno viruses (TTVs) are a group of viruses with small, circular DNA genomes. Members of this family are thought to ubiquitously infect humans, although causal disease associations are currently lacking. At present, there is no understanding of how infection with this diverse group of viruses is so prevalent. Using a combined computational and synthetic approach, we predict and identify miRNA-coding regions in diverse human TTVs and provide evidence for TTV miRNA production in vivo. The TTV miRNAs are transcribed by RNA polymerase II, processed by Drosha and Dicer, and are active in RISC. A TTV mutant defective for miRNA production replicates as well as wild type virus genome; demonstrating that the TTV miRNA is dispensable for genome replication in a cell culture model. We demonstrate that a recombinant TTV genome is capable of expressing an exogenous miRNA, indicating the potential utility of TTV as a small RNA vector. Gene expression profiling of host cells identifies N-myc (and STAT) interactor (NMI) as a target of a TTV miRNA. NMI transcripts are directly regulated through a binding site in the 3′UTR. SiRNA knockdown of NMI contributes to a decreased response to interferon signaling. Consistent with this, we show that a TTV miRNA mediates a decreased response to IFN and increased cellular proliferation in the presence of IFN. Thus, we add Annelloviridae to the growing list of virus families that encode miRNAs, and suggest that miRNA-mediated immune evasion can contribute to the pervasiveness associated with some of these viruses.
Reagents for proteome research must of necessity be generated by high throughput methods. Aptamers are potentially useful as reagents to identify and quantitate individual proteins, yet are currently produced for the most part by manual selection procedures. We have developed automated selection methods, but must still individually purify protein targets. Therefore, we have attempted to select aptamers against protein targets generated by in vitro transcription and translation of individual genes. In order to specifically immobilize the protein targets for selection, they are also biotinylated in vitro. As a proof of this method, we have selected aptamers against translated human U1A, a component of the nuclear spliceosome. Selected sequences demonstrated exquisite mimicry of natural binding sequences and structures. These results not only reveal a potential path to the high throughput generation of aptamers, but also yield insights into the incredible specificity of the U1A protein for its natural RNA ligands.
Facile ''writing'' of DNA fragments that encode entire gene sequences potentially has widespread applications in biological analysis and engineering. Rapid writing of open reading frames (ORFs) for expressed proteins could transform protein engineering and production for protein design, synthetic biology, and structural analysis. Here we present a process, protein fabrication automation (PFA), which facilitates the rapid de novo construction of any desired ORF from oligonucleotides with low effort, high speed, and little human interaction. PFA comprises software for sequence design, data management, and the generation of instruction sets for liquid-handling robotics, a liquid-handling robot, a robust PCR scheme for gene assembly from synthetic oligonucleotides, and a genetic selection system to enrich correctly assembled full-length synthetic ORFs. The process is robust and scalable.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.