2022
DOI: 10.3389/fepid.2022.943625
|View full text |Cite
|
Sign up to set email alerts
|

A maximum-likelihood method to estimate haplotype frequencies and prevalence alongside multiplicity of infection from SNP data

Abstract: The introduction of genomic methods facilitated standardized molecular disease surveillance. For instance, SNP barcodes in Plasmodium vivax and Plasmodium falciparum malaria allows the characterization of haplotypes, their frequencies and prevalence to reveal temporal and spatial transmission patterns. A confounding factor is the presence of multiple genetically distinct pathogen variants within the same infection, known as multiplicity of infection (MOI). Disregarding ambiguous information, as usually done in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(16 citation statements)
references
References 55 publications
(93 reference statements)
1
15
0
Order By: Relevance
“…For the Poisson and conditional Poisson distribution, this approach was generalized to an arbitrary number of SNPs in cf. Li et al ( 30 ) and Tsoungui Obama and Schneider ( 31 ). In Schneider and Escalante ( 12 ) profile-likelihood confidence intervals for the MLE assuming a single locus and a conditional Poisson distribution are constructed.…”
Section: Resultsmentioning
confidence: 99%
“…For the Poisson and conditional Poisson distribution, this approach was generalized to an arbitrary number of SNPs in cf. Li et al ( 30 ) and Tsoungui Obama and Schneider ( 31 ). In Schneider and Escalante ( 12 ) profile-likelihood confidence intervals for the MLE assuming a single locus and a conditional Poisson distribution are constructed.…”
Section: Resultsmentioning
confidence: 99%
“…For the haplotype based test, we assume that the haplotype frequencies are estimated from the noisy allele frequencies by solving the regression model [ 35 ]. Other methods of haplotype frequencies may also be used, see for instance [ 36 ] that proposes an EM algorithm, or [ 37 ] for a maximum likelihood based approach. The variances of the estimated regression coefficients as well as of their sums are then used as additional components of variance.…”
Section: Resultsmentioning
confidence: 99%
“…Reference-free methods, on the other hand, are not scalable. They either only handle 2–4 strains per infection ( Zhu et al 2018 , Gabbassov et al 2021 ) or are limited to only a few SNP sites ( Tsoungui Obama and Schneider 2022 ). Furthermore, these algorithms focus on resolving mixed infections within individual hosts and do not effectively leverage population-level SNP-haplotypes and allele frequencies (AFs).…”
Section: Introductionmentioning
confidence: 99%