Next Generation Sequencing (NGS) technology is based on cutting DNA into small fragments, and their massive parallel sequencing. The multiple overlapping segments termed “reads” are assembled into a contiguous sequence. To reduce sequencing errors, every genome region should be sequenced several dozen times. This sequencing approach is based on the assumption that genomic DNA breaks are random and sequence-independent. However, previously we showed that for the sonicated restriction DNA fragments the rates of double-stranded breaks depend on the nucleotide sequence. In this work we analyzed genomic reads from NGS data and discovered that fragmentation methods based on the action of the hydrodynamic forces on DNA, produce similar bias. Consideration of this non-random DNA fragmentation may allow one to unravel what factors and to what extent influence the non-uniform coverage of various genomic regions.
Distribution of electrostatic potential of DNA fragments was evaluated. A method for calculation of electrostatic potential distribution based on Coulomb's law is proposed for long DNA fragments (approximately 1000 nucleotide pairs). For short DNA sequences, this technique provides a good correlation with the results obtained using Poisson-Boltzmann equation thus justifying its application in comparative studies for long DNA fragments. Calculation was performed for several DNA fragments from E. coli and bacteriophage T7 genomes containing promoter and nonpromoter regions. The results obtained indicate that coding regions are characterized by more homogeneous distribution of electrostatic potential whereas local inhomogeneity of DNA electrostatic profile is typical for promoter regions. The possible role of electrostatic interactions in RNA polymerase-promoter recognition is discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.