Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.
Drosophila melanogaster polytene chromosomes display specific banding pattern; the underlying genetic organization of this pattern has remained elusive for many years. In the present paper, we analyze 32 cytology-mapped polytene chromosome interbands. We estimated molecular locations of these interbands, described their molecular and genetic organization and demonstrate that polytene chromosome interbands contain the 5′ ends of housekeeping genes. As a rule, interbands display preferential “head-to-head” orientation of genes. They are enriched for “broad” class promoters characteristic of housekeeping genes and associate with open chromatin proteins and Origin Recognition Complex (ORC) components. In two regions, 10A and 100B, coding sequences of genes whose 5′-ends reside in interbands map to constantly loosely compacted, early-replicating, so-called “grey” bands. Comparison of expression patterns of genes mapping to late-replicating dense bands vs genes whose promoter regions map to interbands shows that the former are generally tissue-specific, whereas the latter are represented by ubiquitously active genes. Analysis of RNA-seq data (modENCODE-FlyBase) indicates that transcripts from interband-mapping genes are present in most tissues and cell lines studied, across most developmental stages and upon various treatment conditions. We developed a special algorithm to computationally process protein localization data generated by the modENCODE project and show that Drosophila genome has about 5700 sites that demonstrate all the features shared by the interbands cytologically mapped to date.
Salivary gland polytene chromosomes of Drosophila melanogaster have a reproducible set of intercalary heterochromatin (IH) sites, characterized by late DNA replication, underreplicated DNA, breaks and frequent ectopic contacts. The SuUR mutation has been shown to suppress underreplication, and wild-type SuUR protein is found at late-replicating IH sites and in pericentric heterochromatin. Here we show that the SuUR gene influences all four IH features. The SuUR mutation leads to earlier completion of DNA replication. Using transgenic strains with two, four or six additional SuUR(+) doses (4-8xSuUR(+)) we show that wild-type SuUR is an enhancer of DNA underreplication, causing many late-replicating sites to become underreplicated. We map the underreplication sites and show that their number increases from 58 in normal strains (2xSuUR(+)) to 161 in 4-8xSuUR(+) strains. In one of these new sites (1AB) DNA polytenization decreases from 100% in the wild type to 51%-85% in the 4xSuUR (+) strain. In the 4xSuUR(+) strain, 60% of the weak points coincide with the localization of Polycomb group (PcG) proteins. At the IH region 89E1-4 (the Bithorax complex), a typical underreplication site, the degree of underreplication increases with four doses of SuUR(+) but the extent of the underreplicated region is the same as in wild type and corresponds to the region containing PcG binding sites. We conclude that the polytene chromosome regions known as IH are binding sites for SuUR protein and in many cases PcG silencing proteins. We propose that these stable silenced regions are late replicated and, in the presence of SuUR protein, become underreplicated.
In Drosophila, dosage compensation requires assembly of the Male Specific Lethal (MSL) protein complex for doubling transcription of most X-linked genes in males. The recognition of the X chromosome by the MSL complex has been suggested to include initial assembly at approximately 35 chromatin entry sites and subsequent spreading of mature complexes in cis to numerous additional sites along the chromosome. To understand this process further we examined MSL patterns in a range of wild-type and mutant backgrounds producing different amounts of MSL components. Our data support a model in which MSL complex binding to the X is directed by a hierarchy of target sites that display different affinities for the MSL proteins. Chromatin entry sites differ in their ability to provide local intensive binding of complexes to adjacent regions, and need high MSL complex titers to achieve this. We also mapped a set of definite autosomal regions (approximately 70) competent to associate with the functional MSL complex in wild-type males. Overexpression of both MSL1 and MSL2 stabilizes this binding and results in inappropriate MSL binding to the chromocenter and the 4th chromosome. Thus, wild-type MSL complex titers are critical for correct targeting to the X chromosome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.