2022
DOI: 10.1093/nargab/lqac034
|View full text |Cite
|
Sign up to set email alerts
|

A bioinformatics pipeline for estimating mitochondrial DNA copy number and heteroplasmy levels from whole genome sequencing data

Abstract: Mitochondrial diseases are a heterogeneous group of disorders that can be caused by mutations in the nuclear or mitochondrial genome. Mitochondrial DNA (mtDNA) variants may exist in a state of heteroplasmy, where a percentage of DNA molecules harbor a variant, or homoplasmy, where all DNA molecules have the same variant. The relative quantity of mtDNA in a cell, or copy number (mtDNA-CN), is associated with mitochondrial function, human disease, and mortality. To facilitate accurate identification of heteropla… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 21 publications
(33 citation statements)
references
References 47 publications
0
23
0
Order By: Relevance
“…Moreover, homopolymeric Cstretch regions located at nucleotide positions 16 184-16 193 (amplicon 2) in HVI and at positions 303-315 in HVII (amplicon 8) are highly prone to present heteroplasmy produced by the insertion/deletion of cytosines [45]. The presence of homopolymeric C-stretch generates low quality reads, reducing the number of reads that pass quality filters, and the alignment of these regions is usually poor [46,47]. Accordingly, the low performance of amplicons 2 and 8 can also be related to homopolymeric C-stretch.…”
Section: Discussionmentioning
confidence: 99%
“…Moreover, homopolymeric Cstretch regions located at nucleotide positions 16 184-16 193 (amplicon 2) in HVI and at positions 303-315 in HVII (amplicon 8) are highly prone to present heteroplasmy produced by the insertion/deletion of cytosines [45]. The presence of homopolymeric C-stretch generates low quality reads, reducing the number of reads that pass quality filters, and the alignment of these regions is usually poor [46,47]. Accordingly, the low performance of amplicons 2 and 8 can also be related to homopolymeric C-stretch.…”
Section: Discussionmentioning
confidence: 99%
“…Mitochondrial heteroplasmic mutations are composed of a combination of inherited and somatic mutations 7 . We would therefore expect that heteroplasmic mutations would increase with age, as has been previously shown 9,17 , as well as with exposure to mutagenic chemicals such as tobacco. Indeed, we observed a significant increase in heteroplasmic SNVs both as a function of age and smoking status (Fig.…”
Section: Association With All-cause Mortalitymentioning
confidence: 92%
“…We further excluded heteroplasmic variants at poly-C homopolymer regions on the mitochondrial chromosome and excluded INDELs. INDELs are often found at homopolymer regions and due to the nature of these regions, are challenging to accurately call heteroplasmies 9 . Nuclear-encoded mitochondrial sequences (NUMTs) can contribute to false positive mtDNA heteroplasmy calls.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Mitochondrial heteroplasmy was identified from whole genome sequencing (WGS) data from the UK Biobank, a large population study of people from the United Kingdom aged 40-69 years 72 , as described previously 49 . In brief, MitoHPC 73 was used to call heteroplasmic SNVs with a heteroplasmy level of >5%. Variants were filtered using the following criteria: at poly-C homopolymer regions, read depth <300, and or with base quality, strandedness, slippage, weak evidence, germline, or position flags.…”
Section: Methodsmentioning
confidence: 99%