To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.
Hypertrophic cardiomyopathy (HCM) is a genetically heterogeneous cardiac muscle disorder with a diverse natural history, characterized by unexplained left ventricular hypertrophy (LVH), with histopathological hallmarks including myocyte enlargement, myocyte disarray and myocardial fibrosis. Although these features can cause significant cardiac symptoms, many young individuals with HCM are asymptomatic or mildly symptomatic. Sudden cardiac death (SCD) may occur as the initial clinical manifestation. Over the past few decades, HCM has been considered a disease of sarcomere, and typically as an autosomal dominant disease with variable expressivity and incomplete penetrance. Important insights into the genetic landscape of HCM have enhanced our understanding of the molecular pathogenesis, empowered gene-based diagnostic testing to identify at-risk individuals, and offered potential targets for the development of therapeutic agents. This article reviews the current knowledge on the clinical genetics and management of HCM.
The bovine genetic resources in China are diverse, but their value and potential are yet to be discovered. To determine the genetic diversity and population structure of Chinese cattle, we analysed the whole genomes of 46 cattle from six phenotypically and geographically representative Chinese cattle breeds, together with 18 Red Angus cattle (RAN) genomes, 11 Japanese black cattle (JBC) genomes and taurine and indicine genomes available from previous studies. Our results showed that Chinese cattle originated from hybridization between Bos taurus and Bos indicus. Moreover, we found that the level of genetic variation in Chinese cattle depends upon the degree of indicine content. We also discovered many potential selective sweep regions associated with domestication related to breed-specific characteristics, with selective sweep regions including genes associated with coat colour (ERCC2, MC1R, ZBTB17 and MAP2K1), dairy traits (NCAPG, MAPK7, FST, ITFG1, SETMAR, PAG1, CSN3 and RPL37A), and meat production/quality traits (such as BBS2, R3HDM1, IGFBP2, IGFBP5, MYH9, MYH4 and MC5R). These findings substantially expand the catalogue of genetic variants in cattle and reveal new insights into the evolutionary history and domestication traits of Chinese cattle.
Copy number variations (CNVs) have increasingly been reported to cause, or predispose to, human disease. However, a large fraction of these CNVs have not been accurately characterized at the single-base-pair level, thereby hampering a better understanding of the mutational mechanisms underlying CNV formation. Here, employing a composite pipeline method derived from various inference-based programs, we have characterized 26 deletion CNVs [including three novel pathogenic CNVs involving an autosomal gene (EXT2) causing hereditary osteochondromas and an X-linked gene (CLCN5) causing Dent disease, as well as 23 CNVs previously identified by inference from a cohort of Canadian autism spectrum disorder families] to the single-base-pair level of accuracy from whole-genome sequencing data. We found that breakpoint-flanking micro-mutations (within 22 bp of the breakpoint) are present in a significant fraction (5/26; 19%) of the deletion CNVs. This analysis also provided evidence that a recently described error-prone form of DNA repair (i.e., repair of DNA double-strand breaks by templated nucleotide sequence insertions derived from distant regions of the genome) not only causes human genetic disease but also impacts on human genome evolution. Our findings illustrate the importance of precise CNV breakpoint delineation for understanding the underlying mutational mechanisms and have implications for primer design in relation to the detection of deletion CNVs in clinical diagnosis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.