Arina D. Omer scite author profile

Summary We use in situ Hi-C to probe the three-dimensional architecture of genomes, constructing haploid and diploid maps of nine cell types. The densest, in human lymphoblastoid cells, contains 4.9 billion contacts, achieving 1-kilobase resolution. We find that genomes are partitioned into local domains, which are associated with distinct patterns of histone marks and segregate into six subcompartments. We identify ~10,000 loops. These loops frequently link promoters and enhancers, correlate with gene activation, and show conservation across cell types and species. Loop anchors typically occur at domain boundaries and bind CTCF. CTCF sites at loop anchors occur predominantly (>90%) in a convergent orientation, with the asymmetric motifs ‘facing’ one another. The inactive X-chromosome splits into two massive domains and contains large loops anchored at CTCF-binding repeats.

show abstract

A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping

Rao¹,

Huntley²,

Durand³

et al. 2015

Cell

1,025

177

2,666

View full text Add to dashboard Cite

Cell 159, 1665Cell 159, -1680 December 11, 2014) Our paper analyzed the three-dimensional (3D) architecture of genomes at high resolution in nine human and murine cell lines. One of our main conclusions was that the vast majority of loops are anchored at CTCF/cohesin-binding sites whose motifs are oriented in a convergent fashion, i.e., the motifs point at one another. We arrived at this conclusion by analyzing peaks where the two corresponding peak loci each contained a single CTCF-binding motif. We performed this analysis in eight different cell lines.It has come to our attention that, in this analysis, we inadvertently used the wrong peak file for one of the eight cell lines (GM12878). In addition to peaks in which there was a unique motif at each of the two peak loci, this file, which had been generated by a preliminary version of our code, included peaks whenever there was (1) a unique motif at one peak locus and (2) a unique motif on the opposite strand at the other peak locus. We have now redone the analysis using the correct file. As a result, we found that several numbers on page 1675 of the main text and page S73 of the Extended Experimental Procedures, as well as Figure 6D, need to be adjusted as shown below. The correct list of motifs associated with each loop anchor, together with their orientations, has been uploaded to the Gene Expression Omnibus (GEO) at the original accession number for the paper, GEO: GSE63525.These corrections do not affect the numbers for the other seven cell lines and do not modify the conclusions of the paper in any way.The main text corrections from page 1675 are shown below, with the correct numbers underlined and the original text numbers in brackets.''If CTCF sites were randomly oriented, one would expect all four orientations to occur equally often. But when we examined the 2,857 [4,322] peaks in GM12878 where the two corresponding peak loci each contained a single CTCF-binding motif, we found that the vast majority (90% [92%]) of motif pairs are convergent ( Figures 6D and 6E). Overall, the presence, at pairs of peak loci, of bound CTCF sites in the convergent orientation was enriched 102-fold over random expectation (Extended Experimental Procedures). The convergent orientation was overwhelmingly more frequent than the divergent orientation, despite the fact that divergent motifs also lie on opposing strands: in GM12878, the counts were 2,574-10 [3,971-78] (257-fold [51-fold] enrichment, convergent versus divergent); in IMR90, 1, in HMEC,; in K562, 723-2 (362-fold); in HUVEC, 671-4 (168-fold); in HeLa, 301-3 (100-fold); in NHEK, 556-9 (62-fold); and in CH12-LX, 625-8 (78-fold). This pattern suggests that a pair of CTCF sites in the convergent orientation is required for the formation of a loop.The observation that looped CTCF sites occur in the convergent orientation also allows us to analyze peak loci containing multiple CTCF-bound motifs to predict which motif instance plays a role in a given loop. In this way, we can associate nearly two-thirds of peak loci (8...

show abstract

Cohesin Loss Eliminates All Loop Domains

Rao

Huang

Hilaire

et al. 2017

Cell

1,529

223

1,924

View full text Add to dashboard Cite

SUMMARY The human genome folds to create thousands of intervals, called “contact domains,” that exhibit enhanced contact frequency within themselves. “Loop domains” form because of tethering between two loci – almost always bound by CTCF and cohesin – lying on the same chromosome. “Compartment domains” form when genomic intervals with similar histone marks co-segregate. Here, we explore the effects of degrading cohesin. All loop domains are eliminated, but neither compartment domains nor histone marks are affected. Loss of loop domains does not lead to widespread ectopic gene activation, but does affect a significant minority of active genes. In particular, cohesin loss causes superenhancers to co-localize, forming hundreds of links within and across chromosomes, and affecting the regulation of nearby genes. We then restore cohesin and monitor the re-formation of each loop. Although reformation rates vary greatly, many megabase-sized loops recovered in under an hour, consistent with a model where loop extrusion is rapid.

show abstract

De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds

Dudchenko

Batra

Omer

et al. 2017

Science

1,686

1,645

View full text Add to dashboard Cite

The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Ae. aegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species.

show abstract

Improved reference genome of Aedes aegypti informs arbovirus vector control

Matthews

Dudchenko

Kingan

et al. 2018

Nature

462

621

View full text Add to dashboard Cite

Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.

show abstract

Genome sequence of Halobacterium species NRC-1

Kennedy²,

Mahairas

et al. 2000

Proc. Natl. Acad. Sci. U.S.A.

665

548

View full text Add to dashboard Cite

We report the complete sequence of an extreme halophile, Halobacterium sp. NRC-1, harboring a dynamic 2,571,010-bp genome containing 91 insertion sequences representing 12 families and organized into a large chromosome and 2 related minichromosomes. The Halobacterium NRC-1 genome codes for 2,630 predicted proteins, 36% of which are unrelated to any previously reported. Analysis of the genome sequence shows the presence of pathways for uptake and utilization of amino acids, active sodiumproton antiporter and potassium uptake systems, sophisticated photosensory and signal transduction pathways, and DNA replication, transcription, and translation systems resembling more complex eukaryotic organisms. Whole proteome comparisons show the definite archaeal nature of this halophile with additional similarities to the Gram-positive Bacillus subtilis and other bacteria. The ease of culturing Halobacterium and the availability of methods for its genetic manipulation in the laboratory, including construction of gene knockouts and replacements, indicate this halophile can serve as an excellent model system among the archaea.

show abstract

Homologs of Small Nucleolar RNAs in Archaea

Omer

Lowe

Russell

et al. 2000

Science

313

304

View full text Add to dashboard Cite

In eukaryotes, dozens of posttranscriptional modifications are directed to specific nucleotides in ribosomal RNAs (rRNAs) by small nucleolar RNAs (snoRNAs). We identified homologs of snoRNA genes in both branches of the Archaea. Eighteen small sno-like RNAs (sRNAs) were cloned from the archaeon Sulfolobus acidocaldarius by coimmunoprecipitation with archaeal fibrillarin and NOP56, the homologs of eukaryotic snoRNA-associated proteins. We trained a probabilistic model on these sRNAs to search for more sRNAs in archaeal genomic sequences. Over 200 additional sRNAs were identified in seven archaeal genomes representing both the Crenarchaeota and the Euryarchaeota. snoRNA-based rRNA processing was therefore probably present in the last common ancestor of Archaea and Eukarya, predating the evolution of a morphologically distinct nucleolus.

show abstract

The Juicebox Assembly Tools module facilitatesde novoassembly of mammalian genomes with chromosome-length scaffolds for under $1000

Dudchenko

Shamim

Batra

et al. 2018

Preprint

268

275

View full text Add to dashboard Cite

Hi-C contact maps are valuable for genome assembly (Lieberman-Aiden, van Berkum et al. 2009; Burton et al. 2013; Dudchenko et al. 2017). Recently, we developed Juicebox, a system for the visual exploration of Hi-C data (Durand, Robinson et al. 2016), and 3D-DNA, an automated pipeline for using Hi-C data to assemble genomes (Dudchenko et al. 2017). Here, we introduce “Assembly Tools,” a new module for Juicebox, which provides a point-and-click interface for using Hi-C heatmaps to identify and correct errors in a genome assembly. Together, 3D-DNA and the Juicebox Assembly Tools greatly reduce the cost of accurately assembling complex eukaryotic genomes. To illustrate, we generated de novo assemblies with chromosome-length scaffolds for three mammals: the wombat, Vombatus ursinus (3.3Gb), the Virginia opossum, Didelphis virginiana (3.3Gb), and the raccoon, Procyon lotor (2.5Gb). The only inputs for each assembly were Illumina reads from a short insert DNA-Seq library (300 million Illumina reads, maximum length 2x150 bases) and an in situ Hi-C library (100 million Illumina reads, maximum read length 2x150 bases), which cost <$1000.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Arina D. Omer

A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping

A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping

Cohesin Loss Eliminates All Loop Domains

De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds

Improved reference genome of Aedes aegypti informs arbovirus vector control

Genome sequence of Halobacterium species NRC-1

Homologs of Small Nucleolar RNAs in Archaea

The Juicebox Assembly Tools module facilitatesde novoassembly of mammalian genomes with chromosome-length scaffolds for under $1000

Contact Info

Product

Resources

About