Accurate and complete genome sequences are essential in biotechnology to facilitate genome-based cell engineering efforts. The current genome assemblies for Cricetulus griseus, the Chinese hamster, are fragmented and replete with gap sequences and misassemblies, consistent with most short-read based assemblies. Here, we completely resequenced C. griseus using Single Molecule Real Time (SMRT) sequencing and merged this with Illumina-based assemblies. This generated a more contiguous and complete genome assembly than either technology alone, reducing the number of scaffolds by >28-fold, with 90% of the sequence in the 122 longest scaffolds. Most genes are now found in single scaffolds, including up- and downstream regulatory elements, enabling improved study of noncoding regions. With >95% of the gap sequence filled, important CHO cell mutations have been detected in draft assembly gaps. This new assembly will be an invaluable resource for continued basic and pharmaceutical research.
Herpes simplex virus (HSV) is a new platform for gene therapy. We cloned the human herpesvirus HSV-1 strain F genome into a bacterial artificial chromosome (BAC) and adapted chromosomal gene replacement technology to manipulate the viral genome. This technology exploits the power of bacterial genetics and permits generation of recombinant viruses in as few as 7 days. We utilized this technology to delete the viral packaging/cleavage (pac) sites from HSV-BAC. HSV-BAC DNA is stable in bacteria and the pac-deleted HSV-BAC (p45-25) is able to package amplicon plasmid DNA as efficiently as a comparable pac-deleted HSV cosmid set when transfected into mammalian cells. Moreover, the utility of bacterial gene replacement is not limited to HSV, since most herpesviruses can be cloned as BACs. Thus, this technology will greatly facilitate genetic manipulation of all herpesviruses for their use as research tools or as vectors in gene therapy.
The Chinese hamster genome serves as a reference genome for the study of Chinese hamster ovary (CHO) cells, the preferred host system for biopharmaceutical production.Recent re-sequencing of the Chinese hamster genome resulted in the RefSeq PICR metaassembly, a set of highly accurate scaffolds that filled over 95% of the gaps in previous assembly versions. However, these scaffolds did not reach chromosome-scale due to the absence of long-range scaffolding information during the meta-assembly process. Here,
Chinese hamster ovary (CHO) cells are a major host cell line for the production of therapeutic proteins, and CHO cell and Chinese hamster (CH) genomes have recently been sequenced using next-generation sequencing methods. CHOgenome.org was launched in 2011 (version 1.0) to serve as a database repository and to provide bioinformatics tools for the CHO community. CHOgenome.org (version 1.0) maintained GenBank CHO-K1 genome data, identified CHO-omics literature, and provided a CHO-specific BLAST service. Recent major updates to CHOgenome.org (version 2.0) include new sequence and annotation databases for both CHO and CH genomes, a more user-friendly website, and new research tools, including a proteome browser and a genome viewer. CHO cell-line specific sequences and annotations facilitate cell line development opportunities, several of which are discussed. Moving forward, CHOgenome.org will host the increasing amount of CHO-omics data and continue to make useful bioinformatics tools available to the CHO community.
Protein structure is commonly regarded to be conserved and to dictate function. Most proteins rely on conformational flexibility to some degree. Are regions that convey conformational flexibility conserved over evolutionary time? Can changes in conformational flexibility alter protein function? Here, the evolutionary dynamics of structurally ordered and disordered (flexible) regions are investigated genome-wide in flaviviruses, revealing that the amount and location of structural disorder fluctuates highly among related proteins. Some regions are prone to shift between structured and flexible states. Increased evolutionary dynamics of structural disorder is observed for some lineages but not in others. Lineage-specific transitions of this kind could alter the conformational ensemble accessible to the same protein in different species, causing a functional change, even if the predominant function remains conserved. Thus, rapid evolutionary dynamics of structural disorder is a potential driving force for phenotypic divergence among flaviviruses.
BackgroundThe accumulation of protein structural data occurs more rapidly than it can be characterized by traditional laboratory means. This has motivated widespread efforts to predict enzyme function computationally. The most useful/accurate strategies employed to date are based on the detection of motifs in novel structures that correspond to a specific function. Functional residues are critical components of predictively useful motifs. We have implemented a novel method, to complement current approaches, which detects motifs solely on the basis of distance restraints between catalytic residues.ResultsProMOL is a plugin for the PyMOL molecular graphics environment that can be used to create active site motifs for enzymes. A library of 181 active site motifs has been created with ProMOL, based on definitions published in the Catalytic Site Atlas (CSA). Searches with ProMOL produce better than 50% useful Enzyme Commission (EC) class suggestions for level 1 searches in EC classes 1, 4 and 5, and produce some useful results for other classes. 261 additional motifs automatically translated from Jonathan Barker’s JESS motif set [Bioinformatics 19:1644–1649, 2003] and a set of NMR motifs is under development. Alignments are evaluated by visual superposition, Levenshtein distance and root-mean-square deviation (RMSD) and are reasonably consistent with related search methods.ConclusionThe ProMOL plugin for PyMOL provides ready access to template-based local alignments. Recent improvements to ProMOL, including the expanded motif library, RMSD calculations and output selection formatting, have greatly increased the program’s usability and speed, and have improved the way that the results are presented.
This work was supported by: R01HD060769 from the Eunice Kennedy Shriver National Institute for Child Health and Human Development (NICHD), 2P20GM103446 and P20GM103464 from the National Institute of General Medical Sciences (NIGMS), and Nemours Biomedical Research. The authors have no competing interests to declare.
Adventitious agent detection during the production of vaccines and biotechnology-based medicines is of critical importance to ensure the final product is free from any possible viral contamination. Increasing the speed and accuracy of viral detection is beneficial as a means to accelerate development timelines and to ensure patient safety. Here, several rapid viral metagenomics approaches were tested on simulated next-generation sequencing (NGS) data sets and existing data sets from virus spike-in studies done in CHO-K1 and HeLa cell lines. It was observed that these rapid methods had comparable sensitivity to full-read alignment methods used for NGS viral detection for these data sets, but their specificity could be improved. A method that first filters host reads using KrakenUniq and then selects the virus classification tool based on the number of remaining reads is suggested as the preferred approach among those tested to detect nonlatent and nonendogenous viruses. Such an approach shows reasonable sensitivity and specificity for the data sets examined and requires less time and memory as full-read alignment methods. IMPORTANCE Next-generation sequencing (NGS) has been proposed as a complementary method to detect adventitious viruses in the production of biotherapeutics and vaccines to current in vivo and in vitro methods. Before NGS can be established in industry as a main viral detection technology, further investigation into the various aspects of bioinformatics analyses required to identify and classify viral NGS reads is needed. In this study, the ability of rapid metagenomics tools to detect viruses in biopharmaceutical relevant samples is tested and compared to recommend an efficient approach. The results showed that KrakenUniq can quickly and accurately filter host sequences and classify viral reads and had comparable sensitivity and specificity to slower full read alignment approaches, such as BLASTn, for the data sets examined.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.