Background Mammalian X and Y chromosomes share a common evolutionary origin and retain regions of high sequence similarity. Similar sequence content can confound the mapping of short next-generation sequencing reads to a reference genome. It is therefore possible that the presence of both sex chromosomes in a reference genome can cause technical artifacts in genomic data and affect downstream analyses and applications. Understanding this problem is critical for medical genomics and population genomic inference. Results Here, we characterize how sequence homology can affect analyses on the sex chromosomes and present XYalign, a new tool that (1) facilitates the inference of sex chromosome complement from next-generation sequencing data; (2) corrects erroneous read mapping on the sex chromosomes; and (3) tabulates and visualizes important metrics for quality control such as mapping quality, sequencing depth, and allele balance. We find that sequence homology affects read mapping on the sex chromosomes and this has downstream effects on variant calling. However, we show that XYalign can correct mismapping, resulting in more accurate variant calling. We also show how metrics output by XYalign can be used to identify XX and XY individuals across diverse sequencing experiments, including low- and high-coverage whole-genome sequencing, and exome sequencing. Finally, we discuss how the flexibility of the XYalign framework can be leveraged for other uses including the identification of aneuploidy on the autosomes. XYalign is available open source under the GNU General Public License (version 3). Conclusions Sex chromsome sequence homology causes the mismapping of short reads, which in turn affects downstream analyses. XYalign provides a reproducible framework to correct mismapping and improve variant calling on the sex chromsomes.
Oligonucleotide hybridization to a complementary sequence that is covalently attached to an electrochemically active conducting polymer (ECP) coating the working electrode of an electrochemical cell causes an increase in reaction impedance for the ferro-ferricyanide redox couple. We demonstrate the use of this effect to measure, in real time, the progress of DNA polymerase chain reaction (PCR) amplification of a minor component of a DNA extract. The forward primer is attached to the ECP. The solution contains other PCR components and the redox couple. Each cycle of amplification gives an easily measurable impedance increase. Target concentration can be estimated by cycle count to reach a threshold impedance. As proof of principle, we demonstrate an electrochemical real-time quantitative PCR (e-PCR) measurement in the total DNA extracted from chicken blood of an 844 base pair region of the mitochondrial Cytochrome c oxidase gene, present at ∼1 ppm of total DNA. We show that the detection and semiquantitation of as few as 2 copies/μL of target can be achieved within less than 10 PCR cycles.
Rapid, cost-effective identification of genetic variants in small candidate genomic regions remains a challenge, particularly for less well equipped or lower throughput laboratories. The application of Oxford Nanopore Technologies’ MinION sequencer has the potential to fulfil this requirement. We demonstrate a proof of concept for a multiplexing assay that pools PCR amplicons for MinION sequencing to enable sequencing of multiple templates from multiple individuals, which could be applied to gene-targeted diagnostics. A combined strategy of barcoding and sample pooling was developed for simultaneous multiplex MinION sequencing of 100 PCR amplicons. The amplicons are family-specific, spanning a total of 30 loci in DNA isolated from 82 human neurodevelopmental cases and family members. The target regions were chosen for further interrogation because a potentially disease-causative variant had been identified in affected individuals following Illumina exome sequencing. The pooled MinION sequences were deconvoluted by aligning to custom references using the minimap2 aligner software. Our multiplexing approach produced an interpretable and expected sequence from 29 of the 30 targeted genetic loci. The sequence variant which was not correctly resolved in the MinION sequence was adjacent to a five nucleotide homopolymer. It is already known that homopolymers present a resolution problem with the MinION approach. Interestingly despite equimolar quantities of PCR amplicon pooled for sequencing, significant variation in the depth of coverage (127×–19,626×; mean = 8321×, std err = 452.99) was observed. We observed independent relationships between depth of coverage and target length, and depth of coverage and GC content. These relationships demonstrate biases of the MinION sequencer for longer templates and those with lower GC content. We demonstrate an efficient approach for variant discovery or confirmation from short DNA templates using the MinION sequencing device. With less than 130 × depth of coverage required for accurate genotyping, the methodology described here allows for rapid highly multiplexed targeted sequencing of large numbers of samples in a minimally equipped laboratory with a potential cost as much 200 × less than that from Sanger sequencing.
Autosomal recessive ataxias are characterised by a fundamental loss in coordination of gait with associated atrophy of the cerebellum. There is significant clinical and genetic heterogeneity amongst inherited ataxias; however, an early molecular diagnosis is essential with low-risk treatments available for some of these conditions. We describe two female siblings who presented early in life with unsteady gait and cerebellar atrophy. Whole exome sequencing revealed compound heterozygous inheritance of two pathogenic mutations (p.Leu277Pro, c.1506+1G>A) in the coenzyme Q8A gene (COQ8A), a gene central to biosynthesis of coenzyme Q (CoQ). The paternally derived p.Leu277Pro mutation is predicted to disrupt a conserved motif in the substrate-binding pocket of the protein, resulting in inhibition of CoQ production. The maternal c.1506+1G>A mutation destroys a canonical splice donor site in exon 12 affecting transcript processing and subsequent protein translation. Mutations in this gene can result in primary coenzyme Q deficiency type 4, which is characterized by childhood onset of cerebellar ataxia and exercise intolerance, both of which were observed in this sib-pair. Muscle biopsies revealed unequivocally low levels of CoQ and the siblings were subsequently established on a therapeutic dose of CoQ with distinct clinical evidence of improvement after 1 year of treatment. This case emphasises the importance of an early and accurate molecular diagnosis for suspected inherited ataxias, particularly given the availability of approved treatments for some subtypes.
Mutations in the gene SLC19A3 result in thiamine metabolism dysfunction syndrome 2, also known as biotin-thiamine-responsive basal ganglia disease (BTBGD). This neurometabolic disease typically presents in early childhood with progressive neurodegeneration, including confusion, seizures, and dysphagia, advancing to coma and death. Treatment is possible via supplement of biotin and/or thiamine, with early treatment resulting in significant lifelong improvements. Here we report two siblings who received a refined diagnosis of BTBGD following whole-genome sequencing. Both children inherited compound heterozygous mutations from unaffected parents; a missense single-nucleotide variant (p.G23V) in the first transmembrane domain of the protein, and a 4808-bp deletion in exon 1 encompassing the 5′ UTR and minimal promoter region. This deletion is the smallest promoter deletion reported to date, further defining the minimal promoter region of SLC19A3. Unfortunately, one of the siblings died prior to diagnosis, but the other is showing significant improvement after commencement of therapy. This case demonstrates the power of whole-genome sequencing for the identification of structural variants and subsequent diagnosis of rare neurodevelopmental disorders.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.