Accurate identification of the transmitted virus and sequences evolving from it could be instrumental in elucidating the transmission of human immunodeficiency virus type 1 (HIV-1) and in developing vaccines, drugs, or microbicides to prevent infection. Here we describe an experimental approach to analyze HIV-1 env genes as intact genetic units amplified from plasma virion RNA by single-genome amplification (SGA), followed by direct sequencing of uncloned DNA amplicons. We show that this strategy precludes in vitro artifacts caused by Taq-induced nucleotide substitutions and template switching, provides an accurate representation of the env quasispecies in vivo, and has an overall error rate (including nucleotide misincorporation, insertion, and deletion) of less than 8 ؋ 10 ؊5 . Applying this method to the analysis of virus in plasma from 12 Zambian subjects from whom samples were obtained within 3 months of seroconversion, we show that transmitted or early founder viruses can be identified and that molecular pathways and rates of early env diversification can be defined. Specifically, we show that 8 of the 12 subjects were each infected by a single virus, while 4 others acquired more than one virus; that the rate of virus evolution in one subject during an 80-day period spanning seroconversion was 1.7 ؋ 10 ؊5 substitutions per site per day; and that evidence of strong immunologic selection can be seen in Env and overlapping Rev sequences based on nonrandom accumulation of nonsynonymous mutations. We also compared the results of the SGA approach with those of moreconventional bulk PCR amplification methods performed on the same patient samples and found that the latter is associated with excessive rates of Taq-induced recombination, nucleotide misincorporation, template resampling, and cloning bias. These findings indicate that HIV-1 env genes, other viral genes, and even full-length viral genomes responsible for productive clinical infection can be identified by SGA analysis of plasma virus sampled at intervals typical in large-scale vaccine trials and that pathways of viral diversification and immune escape can be determined accurately.
We describe a mathematical model and Monte-Carlo (MC) simulation of viral evolution during acute infection. We consider both synchronous and asynchronous processes of viral infection of new target cells. The model enables an assessment of the expected sequence diversity in new HIV-1 infections originating from a single transmitted viral strain, estimation of the most recent common ancestor (MRCA) of the transmitted viral lineage, and estimation of the time to coalesce back to the MRCA. We also calculate the probability of the MRCA being the transmitted virus or an evolved variant. Excluding insertions and deletions, we assume HIV-1 evolves by base substitution without selection pressure during the earliest phase of HIV-1 infection prior to the immune response. Unlike phylogenetic methods that follow a lineage backwards to coalescence, we compare the observed data to a model of the diversification of a viral population forward in time. To illustrate the application of these methods, we provide detailed comparisons of the model and simulations results to 306 envelope sequences obtained from 8 newly infected subjects at a single time point. The data from 6/8 patients were in good agreement with model predictions, and hence compatible with a single-strain infection evolving under no selection pressure. The diversity of the samples from the other two patients was too great to be explained by the model, suggesting multiple HIV-1-strains were transmitted. The model can also be applied to longitudinal patient data to estimate within-host viral evolutionary parameters.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.