The genetic content of wild-type human cytomegalovirus was investigated by sequencing the 235 645 bp genome of a low passage strain (Merlin). Substantial regions of the genome (genes RL1-UL11, UL105-UL112 and UL120-UL150) were also sequenced in several other strains, including two that had not been passaged in cell culture. Comparative analyses, which employed the published genome sequence of a high passage strain (AD169), indicated that Merlin accurately reflects the wild-type complement of 165 genes, containing no obvious mutations other than a single nucleotide substitution that truncates gene UL128. A sizeable subset of genes exhibits unusually high variation between strains, and comprises many, but not all, of those that encode proteins known or predicted to be secreted or membrane-associated. In contrast to unpassaged strains, all of the passaged strains analysed have visibly disabling mutations in one or both of two groups of genes that may influence cell tropism. One comprises UL128, UL130 and UL131A, which putatively encode secreted proteins, and the other contains RL5A, RL13 and UL9, which are members of the RL11 glycoprotein gene family. The case in support of a lack of protein-coding potential in the region between UL105 and UL111A was also strengthened.
Human cytomegalovirus (HCMV) in clinical material cannot replicate efficiently in vitro until it has adapted by mutation. Consequently, wild-type HCMV differ fundamentally from the passaged strains used for research. To generate a genetically intact source of HCMV, we cloned strain Merlin into a self-excising BAC. The Merlin BAC clone had mutations in the RL13 gene and UL128 locus that were acquired during limited replication in vitro prior to cloning. The complete wild-type HCMV gene complement was reconstructed by reference to the original clinical sample. Characterization of viruses generated from repaired BACs revealed that RL13 efficiently repressed HCMV replication in multiple cell types; moreover, RL13 mutants rapidly and reproducibly emerged in transfectants. Virus also acquired mutations in genes UL128, UL130, or UL131A, which inhibited virus growth specifically in fibroblast cells in wild-type form. We further report that RL13 encodes a highly glycosylated virion envelope protein and thus has the potential to modulate tropism. To overcome rapid emergence of mutations in genetically intact HCMV, we developed a system in which RL13 and UL131A were conditionally repressed during virus propagation. This technological advance now permits studies to be undertaken with a clonal, characterized HCMV strain containing the complete wild-type gene complement and promises to enhance the clinical relevance of fundamental research on HCMV.
Deep sequencing was used to bring high resolution to the human cytomegalovirus (HCMV) transcriptome at the stage when infectious virion production is under way, and major findings were confirmed by extensive experimentation using conventional techniques. The majority (65.1%) of polyadenylated viral RNA transcription is committed to producing four noncoding transcripts (RNA2.7, RNA1.2, RNA4.9, and RNA5.0) that do not substantially overlap designated protein-coding regions. Additional noncoding RNAs that are transcribed antisense to protein-coding regions map throughout the genome and account for 8.7% of transcription from these regions. RNA splicing is more common than recognized previously, which was evidenced by the identification of 229 potential donor and 132 acceptor sites, and it affects 58 proteincoding genes. The great majority (94) of 96 splice junctions most abundantly represented in the deep-sequencing data was confirmed by RT-PCR or RACE or supported by involvement in alternative splicing. Alternative splicing is frequent and particularly evident in four genes (RL8A, UL74A, UL124, and UL150A) that are transcribed by splicing from any one of many upstream exons. The analysis also resulted in the annotation of four previously unrecognized protein-coding regions (RL8A, RL9A, UL150A, and US33A), and expression of the UL150A protein was shown in the context of HCMV infection. The overall conclusion, that HCMV transcription is complex and multifaceted, has implications for the potential sophistication of virus functionality during infection. The study also illustrates the key contribution that deep sequencing can make to the genomics of nuclear DNA viruses.T he genetic repertoire of human cytomegalovirus (HCMV; species Human herpesvirus 5) is incompletely understood. Most bioinformatic investigations have focused on identifying open reading frames (ORFs) that are conserved in other organisms or that exhibit pattern-based similarities (e.g., in nucleotide or codon bias) to recognized protein-coding regions (CRs) (1). Our current map of the wild-type HCMV genome, based on strain Merlin, contains 166 protein-coding genes (2-5). It is entirely possible that additional, small protein-coding genes will be found. Candidates involve ORFs that overlap recognized CRs and for which there is some evidence for expression (6), ORFs highlighted in pattern-based bioinformatic (7) or proteomic analyses (8), and ORFs whose expression is presently unsuspected.Recognition of many protein-coding genes has been supplemented by information on protein expression and function. However, HCMV also specifies polyadenylated (polyA) transcripts that, because they lack sizeable, conserved ORFs, appear unlikely to function via translation. One class consists of noncoding, nonoverlapping transcripts (NNTs) that do not substantially overlap the designated CRs of other genes. These include an abundant 2.7-kb RNA (β2.7 or RNA2.7) (9), a 1.1-kb spliced RNA and associated 5-kb stable intron (RNA5.0) (10, 11), and a 1.2-kb RNA (RNA1.2) (12). RNA...
Mutations that occurred during adaptation of human cytomegalovirus to cell culture were monitored by isolating four strains from clinical samples, passaging them in various cell types and sequencing ten complete virus genomes from the final passages. Mutational dynamics were assessed by targeted sequencing of intermediate passages and the original clinical samples. Gene RL13 and the UL128 locus (UL128L, consisting of genes UL128, UL130 and UL131A) mutated in all strains. Mutations in RL13 occurred in fibroblast, epithelial and endothelial cells, whereas those in UL128L were limited to fibroblasts and detected later than those in RL13. In addition, a region containing genes UL145, UL144, UL142, UL141 and UL140 mutated in three strains. All strains exhibited numerous mutations in other regions of the genome, with a preponderance in parts of the inverted repeats. An investigation was carried out on the kinetic growth yields of viruses derived from selected passages that were predominantly non-mutated in RL13 and UL128L (RL13+UL128L+), or that were largely mutated in RL13 (RL13−UL128L+) or both RL13 and UL128L (RL13−UL128L−). RL13−UL128L− viruses produced greater yields of infectious progeny than RL13−UL128L+ viruses, and RL13−UL128L+ viruses produced greater yields than RL13+UL128L+ viruses. These results suggest strongly that RL13 and UL128L exert at least partially independent suppressive effects on growth in fibroblasts. As all isolates proved genetically unstable in all cell types tested, caution is advised in choosing and monitoring strains for experimental studies of vulnerable functions, particularly those involved in cell tropism, immune evasion or growth temperance.
The gene complement of wild-type human cytomegalovirus (HCMV) is incompletely understood, on account of the size and complexity of the viral genome and because laboratory strains have undergone deletions and rearrangements during adaptation to growth in culture. We have determined the sequence (241 087 bp) of chimpanzee cytomegalovirus (CCMV) and have compared it with published HCMV sequences from the laboratory strains AD169 and Toledo, with the aim of clarifying the gene content of wild-type HCMV. The HCMV and CCMV genomes are moderately diverged and essentially collinear. On the basis of conservation of potential proteincoding regions and other sequence features, we have discounted 51 previously proposed HCMV ORFs, modified the interpretations for 24 (including assignments of multiple exons) and proposed ten novel genes. Several errors were detected in the published HCMV sequences. We presently recognize 165 genes in CCMV and 145 in AD169; this compares with an estimate of 189 unique genes for AD169 made in 1990. Our best estimate for the complement of wild-type HCMV is 164 to 167 genes. INTRODUCTIONHuman cytomegalovirus (HCMV; human herpesvirus 5) is ubiquitous and largely inapparent, but poses a risk of serious disease to those lacking a competent immune system, such as neonates, transplant patients and sufferers from AIDS (reviewed in Pass, 2001). HCMV is the prototype of subfamily Betaherpesvirinae, and is the most complex of the eight human herpesvirus species. HCMV is isolated routinely on human fibroblast cell lines, and several strains in common laboratory use, such as AD169 and Towne, were derived by multiple passages on such cells (reviewed in Mocarski & Tan Courcelle, 2001).The linear, double-stranded DNA genome of AD169 comprises two covalently linked segments (L and S), each consisting of a unique region (U L and U S ) flanked by an inverted repeat (TR L and IR L , TR S and IR S ), yielding the overall genome configuration TR L -U L -IR L -IR S -U S -TR S (reviewed in Mocarski & Tan Courcelle, 2001). In addition, the genome is terminally redundant, possessing a short region (the a sequence) as a direct repeat at the termini and also in inverse orientation at the IR L -IR S junction. Some genomes contain tandemly reiterated copies of the a sequence at these locations. U L and U S can invert relative to each other by recombination between inverted repeats in replicating DNA, resulting in four equimolar genome arrangements in virion DNA. The complete DNA sequence of AD169 was published in a seminal paper by Chee et al. (1990), and at that time was the largest viral genome sequence available. The total genome size was 229 354 bp, with U L being 166 972 bp, U S 35 418 bp, R L (a collective term for TR L and IR L ) 11 247 bp, R S (TR S and IR S ) 2524 bp and the a sequence (part of R L and R S in the sizes given above) 578 bp.As a primary criterion for identifying protein-coding regions, Chee et al. (1990) focused on open reading frames (ORFs) of 100 or more contiguous amino acidencoding codons that ov...
We have assessed two approaches to sequencing complete human cytomegalovirus (HCMV) genomes (236 kbp) in DNA extracted from infected cell cultures (strains 3157, HAN13, HAN20 and HAN38) or clinical specimens (strains JP and 3301). The first approach involved amplifying genomes from the DNA samples as overlapping PCR products, sequencing these by the Sanger method, acquiring reads from a capillary instrument and assembling these using the Staden programs. The second approach involved generating sequence data from the DNA samples by using an Illumina Genome Analyzer (IGA), processing the filtered reads by reference-independent (de novo) assembly, utilizing the resulting sequence to direct reference-dependent assembly of the same data and finishing by limited PCR sequencing. Both approaches were successful. In particular, the investigation demonstrated the utility of IGA data for efficiently sequencing genomes from clinical samples containing as little as 3 % HCMV DNA. Analysis of the genome sequences obtained showed that each of the strains grown in cell culture was a mutant. Certain of the mutations were shared among strains from independent clinical sources, thus suggesting that they may have arisen in a common ancestor during natural infection. Moreover, one of the strains (JP) sequenced directly from a clinical specimen was mutated in two genes, one of which encodes a proposed immune-evasion function, viral interleukin-10. These observations imply that HCMV mutants exist in human infections.
Two novel spliced genes (UL131A and UL128) flanking UL130 were predicted from sequence comparisons between human cytomegalovirus (HCMV) and its closest known relative, chimpanzee cytomegalovirus (CCMV), and the splicing patterns were confirmed by mRNA mapping experiments. Both genes were transcribed with late kinetics and shared a polyadenylation site. Comparisons with wild-type HCMV in infected human tissues showed that three of five isolates passaged in cell culture contained disruptions of UL128, one was frameshifted in UL131A and one exhibited a deletion affecting UL131A and UL130. CCMV and the Colburn strain of simian cytomegalovirus, which have been passaged in cell culture, also exhibit disruptions of UL128. These observations indicate that expression of either one of UL128 and UL131A is deleterious to growth of primate cytomegaloviruses in cell culture. Although the functions of these genes are unknown, sequence comparisons suggest that UL128 encodes a b-chemokine.The genome of the AD169 strain of human cytomegalovirus (HCMV; human herpesvirus 5) was characterized by Chee et al. (1990) as containing 189 putative protein-coding open reading frames (ORFs), some duplicated in an inverted repeat. An additional genome region was subsequently discovered in the Toledo strain (Cha et al., 1996). A recent comparison of these sequences with that of chimpanzee cytomegalovirus (CCMV) indicated that wild-type HCMV has 166-169 genes (Davison et al., 2003a, b). The present work concerns two of the eleven newly predicted genes in this redefined set.The upper part of Fig. 1(A) depicts the arrangement of ORFs UL131-UL128 as predicted by Chee et al. (1990), and the lower parts show alternative predictions based on comparisons between the AD169 and CCMV sequences. UL130 is unaltered, while spliced genes replace UL131 upstream and UL129 plus UL128 downstream. One of these genes is named UL131A because it occupies the same region as UL131 but does not share any encoded amino acid sequence, since the first exon is in a different reading frame from UL131. The other spliced gene retains the designation UL128 because it shares amino acid sequence with the original UL128 but not with UL129. Fig. 1(B, C) shows detailed alignments of the AD169 and CCMV sequences in these regions. Protein-coding regions were proposed from conservation of encoded amino acid sequences, and conceptually linked together via candidate splice donor and acceptor sites. This led to the hypothesis that UL131A and UL128 comprise two and three exons, respectively.In order to sustain this interpretation, it is necessary to propose that AD169 has a frameshift mutation (an additional residue making a tract of eight A residues) in UL131A exon 1 (Fig. 1B) and that CCMV has a frameshift mutation (an additional residue making a tract of eight C residues) in UL128 exon 1 (Fig. 1C). Davison et al. (2003a) confirmed the former lesion by resequencing the relevant AD169 region and comparing it with sequence obtained directly from clinical material. The proposed mutation in...
The human cytomegalovirus (HCMV) virion envelope contains a complex consisting of glycoproteins gH and gL plus proteins encoded by the UL128 locus (UL128L): pUL128, pUL130, and pUL131A. UL128L is necessary for efficient infection of myeloid, epithelial, and endothelial cells but limits replication in fibroblasts. Consequently, disrupting mutations in UL128L are rapidly selected when clinical isolates are cultured in fibroblasts. In contrast, bacterial artificial chromosome (BAC)-cloned strains TB40-BAC4, FIX, and TR do not contain overt disruptions in UL128L, yet no virus reconstituted from them has been reported to acquire mutations in UL128L in vitro. We performed BAC mutagenesis and reconstitution experiments to test the hypothesis that these strains contain subtle mutations in UL128L that were acquired during passage prior to BAC cloning. Compared to strain Merlin containing wild-type UL128L, all three strains produced higher yields of cell-free virus. Moreover, TB40-BAC4 and FIX spread cell to cell more rapidly than wild-type Merlin in fibroblasts but more slowly in epithelial cells. The differential growth properties of TB40-BAC4 and FIX (but not TR) were mapped to single-nucleotide substitutions in UL128L. The substitution in TB40-BAC4 reduced the splicing efficiency of UL128, and that in FIX resulted in an amino acid substitution in UL130. Introduction of these substitutions into Merlin dramatically increased yields of cell-free virus and increased cell-to-cell spread in fibroblasts but reduced the abundance of pUL128 in the virion and the efficiency of epithelial cell infection. These substitutions appear to represent mutations in UL128L that permit virus to be propagated in fibroblasts while retaining epithelial cell tropism.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.