BackgroundAs sequence data sets used for the investigation of pathogen transmission patterns increase in size, automated tools and standardized methods for cluster analysis have become necessary. We have developed an automated Cluster Picker which identifies monophyletic clades meeting user-input criteria for bootstrap support and maximum genetic distance within large phylogenetic trees. A second tool, the Cluster Matcher, automates the process of linking genetic data to epidemiological or clinical data, and matches clusters between runs of the Cluster Picker.ResultsWe explore the effect of different bootstrap and genetic distance thresholds on clusters identified in a data set of publicly available HIV sequences, and compare these results to those of a previously published tool for cluster identification. To demonstrate their utility, we then use the Cluster Picker and Cluster Matcher together to investigate how clusters in the data set changed over time. We find that clusters containing sequences from more than one UK location at the first time point (multiple origin) were significantly more likely to grow than those representing only a single location.ConclusionsThe Cluster Picker and Cluster Matcher can rapidly process phylogenetic trees containing tens of thousands of sequences. Together these tools will facilitate comparisons of pathogen transmission dynamics between studies and countries.
Background. Many studies of sexual behavior have shown that individuals vary greatly in their number of sexual partners over time, but it has proved difficult to obtain parameter estimates relating to the dynamics of human immunodeficiency virus (HIV) transmission except in small-scale contact tracing studies. Recent developments in molecular phylodynamics have provided new routes to obtain these parameter estimates, and current clinical practice provides suitable data for entire infected populations.Methods. A phylodynamic analysis was performed on partial pol gene sequences obtained for routine clinical care from 14 560 individuals, representing approximately 60% of the HIV-positive men who have sex with men (MSM) under care in the United Kingdom.Results. Among individuals linked to others in the data set, 29% are linked to only 1 individual, 41% are linked to 2–10 individuals, and 29% are linked to ≥10 individuals. The right-skewed degree distribution can be approximated by a power law, but the data are best fitted by a Waring distribution for all time depths. For time depths of 5–7 years, the distribution parameter ρ lies within the range that indicates infinite variance.Conclusions. The transmission network among UK MSM is characterized by preferential association such that a randomly distributed intervention would not be expected to stop the epidemic.
The heterosexual risk group has become the largest HIV infected group in the United Kingdom during the last 10 years, but little is known of the network structure and dynamics of viral transmission in this group. The overwhelming majority of UK heterosexual infections are of non-B HIV subtypes, indicating viruses originating among immigrants from sub-Saharan Africa. The high rate of HIV evolution, combined with the availability of a very high density sample of viral sequences from routine clinical care has allowed the phylodynamics of the epidemic to be investigated for the first time. Sequences of the viral protease and partial reverse transcriptase coding regions from 11,071 patients infected with HIV of non-B subtypes were studied. Of these, 2774 were closely linked to at least one other sequence by nucleotide distance. Including the closest sequences from the global HIV database identified 296 individuals that were in UK-based groups of 3 or more individuals. There were a total of 8 UK-based clusters of 10 or more, comprising 143/2774 (5%) individuals, much lower than the figure of 25% obtained earlier for men who have sex with men (MSM). Sample dates were incorporated into relaxed clock phylogenetic analyses to estimate the dates of internal nodes. From the resulting time-resolved phylogenies, the internode lengths, used as estimates of maximum transmission intervals, had a median of 27 months overall, over twice as long as obtained for MSM (14 months), with only 2% of transmissions occurring in the first 6 months after infection. This phylodynamic analysis of non-B subtype HIV sequences representing over 40% of the estimated UK HIV-infected heterosexual population has revealed heterosexual HIV transmission in the UK is clustered, but on average in smaller groups and is transmitted with slower dynamics than among MSM. More effective intervention to restrict the epidemic may therefore be feasible, given effective diagnosis programmes.
Genetically diverse pathogens (such as Human Immunodeficiency virus type 1, HIV-1) are frequently stratified into phylogenetically or immunologically defined subtypes for classification purposes. Computational identification of such subtypes is helpful in surveillance, epidemiological analysis and detection of novel variants, e.g., circulating recombinant forms in HIV-1. A number of conceptually and technically different techniques have been proposed for determining the subtype of a query sequence, but there is not a universally optimal approach. We present a model-based phylogenetic method for automatically subtyping an HIV-1 (or other viral or bacterial) sequence, mapping the location of breakpoints and assigning parental sequences in recombinant strains as well as computing confidence levels for the inferred quantities. Our Subtype Classification Using Evolutionary ALgorithms (SCUEAL) procedure is shown to perform very well in a variety of simulation scenarios, runs in parallel when multiple sequences are being screened, and matches or exceeds the performance of existing approaches on typical empirical cases. We applied SCUEAL to all available polymerase (pol) sequences from two large databases, the Stanford Drug Resistance database and the UK HIV Drug Resistance Database. Comparing with subtypes which had previously been assigned revealed that a minor but substantial (≈5%) fraction of pure subtype sequences may in fact be within- or inter-subtype recombinants. A free implementation of SCUEAL is provided as a module for the HyPhy package and the Datamonkey web server. Our method is especially useful when an accurate automatic classification of an unknown strain is desired, and is positioned to complement and extend faster but less accurate methods. Given the increasingly frequent use of HIV subtype information in studies focusing on the effect of subtype on treatment, clinical outcome, pathogenicity and vaccine design, the importance of accurate, robust and extensible subtyping procedures is clear.
Patients infected with prevalent non-B subtypes were as likely to achieve viral load suppression as persons infected with subtype B and showed comparable rates of CD4 cell count recovery. HAART achieves excellent outcomes regardless of the infecting subtype.
Disease progression in HIV-infected individuals varies greatly, and while the environmental and host factors influencing this variation have been widely investigated, the viral contribution to variation in set-point viral load, a predictor of disease progression, is less clear. Previous studies, using transmission-pairs and analysis of phylogenetic signal in small numbers of individuals, have produced a wide range of viral genetic effect estimates. Here we present a novel application of a population-scale method based in quantitative genetics to estimate the viral genetic effect on set-point viral load in the UK subtype B HIV-1 epidemic, based on a very large data set. Analyzing the initial viral load and associated pol sequence, both taken before anti-retroviral therapy, of 8,483 patients, we estimate the proportion of variance in viral load explained by viral genetic effects to be 5.7% (CI 2.8–8.6%). We also estimated the change in viral load over time due to selection on the virus and environmental effects to be a decline of 0.05 log10 copies/mL/year, in contrast to recent studies which suggested a reported small increase in viral load over the last 20 years might be due to evolutionary changes in the virus. Our results suggest that in the UK epidemic, subtype B has a small but significant viral genetic effect on viral load. By allowing the analysis of large sample sizes, we expect our approach to be applicable to the estimation of the genetic contribution to traits in many organisms.
ObjectivesThe aim of this study was to characterize the prevalence and patterns of genotypic integrase inhibitor (INI) resistance in relation to HIV-1 clade.MethodsThe cohort comprised 533 INI-naive subjects and 255 raltegravir recipients with viraemia who underwent integrase sequencing in routine care across Europe, including 134/533 (25.1%) and 46/255 (18.0%), respectively, with non-B clades (A, C, D, F, G, CRF01, CRF02, other CRFs, complex).ResultsNo major INI resistance-associated mutations (RAMs) occurred in INI-naive subjects. Among raltegravir recipients with viraemia (median 3523 HIV-1 RNA copies/mL), 113/255 (44.3%) had one or more major INI RAMs, most commonly N155H (45/255, 17.6%), Q148H/R/K + G140S/A (35/255, 13.7%) and Y143R/C/H (12/255, 4.7%). In addition, four (1.6%) raltegravir recipients showed novel mutations at recognized resistance sites (E92A, S147I, N155D, N155Q) and novel mutations at other integrase positions that were statistically associated with raltegravir exposure (K159Q/R, I161L/M/T/V, E170A/G). Comparing subtype B with non-B clades, Q148H/R/K occurred in 42/209 (20.1%) versus 2/46 (4.3%) subjects (P = 0.009) and G140S/A occurred in 36/209 (17.2%) versus 1/46 (2.2%) subjects (P = 0.005). Intermediate- to high-level cross-resistance to twice-daily dolutegravir was predicted in 40/255 (15.7%) subjects, more commonly in subtype B versus non-B clades (39/209, 18.7% versus 1/46, 2.2%; P = 0.003). A glycine (G) to serine (S) substitution at integrase position 140 required one nucleotide change in subtype B and two nucleotide changes in all non-B clades.ConclusionsNo major INI resistance mutations occurred in INI-naive subjects. Reduced occurrence of Q148H/R/K + G140S/A was seen in non-B clades versus subtype B, and was explained by the higher genetic barrier to the G140S mutation observed in all non-B clades analysed.
Background. The United Kingdom human immunodeficiency virus (HIV) epidemic was historically dominated by HIV subtype B transmission among men who have sex with men (MSM). Now 50% of diagnoses and prevalent infections are among heterosexual individuals and mainly involve non-B subtypes. Between 2002 and 2010, the prevalence of non-B diagnoses among MSM increased from 5.4% to 17%, and this study focused on the drivers of this change.Methods. Growth between 2007 and 2009 in transmission clusters among 14 000 subtype A1, C, D, and G sequences from the United Kingdom HIV Drug Resistance Database was analysed by risk group.Results. Of 1148 clusters containing at least 2 sequences in 2007, >75% were pairs and >90% were heterosexual. Most clusters (71.4%) did not grow during the study period. Growth was significantly lower for small clusters and higher for clusters of ≥7 sequences, with the highest growth observed for clusters comprising sequences from MSM and people who inject drugs (PWID). Risk group (P < .0001), cluster size (P < .0001), and subtype (P < .01) were predictive of growth in a generalized linear model.Discussion. Despite the increase in non-B subtypes associated with heterosexual transmission, MSM and PWID are at risk for non-B infections. Crossover of subtype C from heterosexuals to MSM has led to the expansion of this subtype within the United Kingdom.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.