BackgroundStructural variants (SVs) are less common than single nucleotide polymorphisms and indels in the population, but collectively account for a significant fraction of genetic polymorphism and diseases. Base pair differences arising from SVs are on a much higher order (>100 fold) than point mutations; however, none of the current detection methods are comprehensive, and currently available methodologies are incapable of providing sufficient resolution and unambiguous information across complex regions in the human genome. To address these challenges, we applied a high-throughput, cost-effective genome mapping technology to comprehensively discover genome-wide SVs and characterize complex regions of the YH genome using long single molecules (>150 kb) in a global fashion.ResultsUtilizing nanochannel-based genome mapping technology, we obtained 708 insertions/deletions and 17 inversions larger than 1 kb. Excluding the 59 SVs (54 insertions/deletions, 5 inversions) that overlap with N-base gaps in the reference assembly hg19, 666 non-gap SVs remained, and 396 of them (60%) were verified by paired-end data from whole-genome sequencing-based re-sequencing or de novo assembly sequence from fosmid data. Of the remaining 270 SVs, 260 are insertions and 213 overlap known SVs in the Database of Genomic Variants. Overall, 609 out of 666 (90%) variants were supported by experimental orthogonal methods or historical evidence in public databases. At the same time, genome mapping also provides valuable information for complex regions with haplotypes in a straightforward fashion. In addition, with long single-molecule labeling patterns, exogenous viral sequences were mapped on a whole-genome scale, and sample heterogeneity was analyzed at a new level.ConclusionOur study highlights genome mapping technology as a comprehensive and cost-effective method for detecting structural variation and studying complex regions in the human genome, as well as deciphering viral integration into the host genome.Electronic supplementary materialThe online version of this article (doi:10.1186/2047-217X-3-34) contains supplementary material, which is available to authorized users.
The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.
A certain amount of Na2S2O3·5H2O solution added to the solution containing cadmium ions to form Cd/CdS photocatalysts could remove cadmium ions and produce hydrogen efficiently under visible light irradiation.
Cutin and wax are the main precursors of the cuticle that covers the aerial parts of plants and provide protection against biotic and abiotic stresses. Long-chain acyl-CoA synthetases (LACSs) play diversified roles in the synthesis of cutin, wax, and triacylglycerol (TAG). Most of the information concerned with LACS functions is obtained from model plants, whereas the roles of LACS genes in Glycine max are less known. Here, we have identified 19 LACS genes in Glycine max, an important crop plant, and further focused our attention on 4 LACS2 genes (named as GmLACS2-1, 2, 3, 4, respectively). These GmLACS2 genes display different expression patterns in various organs and also show different responses to abiotic stresses, implying that these genes might play diversified functions during plant growth and against stresses. To further identify the role of GmLACS2-3, greatly induced by abiotic stresses, we transformed a construct containing its full length of coding sequence into Arabidopsis. The expression of GmLACS2-3 in an Arabidopsis atlacs2 mutant greatly suppressed its phenotype, suggesting it plays conserved roles with that of AtLACS2. The overexpression of GmLACS2-3 in wild-type plants significantly increased the amounts of cutin and suberin but had little effect on wax amounts, indicating the specific role of GmLACS2-3 in the synthesis of cutin and suberin. In addition, these GmLACS2-3 overexpressing plants showed enhanced drought tolerance. Taken together, our study deepens our understanding of the functions of LACS genes in different plants and also provides a clue for cultivating crops with strong drought resistance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.