The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Tetraodon nigroviridis is a freshwater puffer fish with the smallest known vertebrate genome. Here, we report a draft genome sequence with long-range linkage and substantial anchoring to the 21 Tetraodon chromosomes. Genome analysis provides a greatly improved fish gene catalogue, including identifying key genes previously thought to be absent in fish. Comparison with other vertebrates and a urochordate indicates that fish proteins have diverged markedly faster than their mammalian homologues. Comparison with the human genome suggests ,900 previously unannotated human genes. Analysis of the Tetraodon and human genomes shows that whole-genome duplication occurred in the teleost fish lineage, subsequent to its divergence from mammals. The analysis also makes it possible to infer the basic structure of the ancestral bony vertebrate genome, which was composed of 12 chromosomes, and to reconstruct much of the evolutionary history of ancient and recent chromosome rearrangements leading to the modern human karyotype.Access to entire genome sequences is revolutionizing our understanding of how genetic information is stored and organized in DNA, and how it has evolved over time. The sequence of a genome provides exquisite detail of the gene catalogue within a species, and the recent analysis of near-complete genome sequences of three mammals (human 1 , mouse 2 and rat 3 ) shows the acceleration in the search for causal links between genotype and phenotype, which can then be related to physiological, ecological and evolutionary observations. The partial sequence of the compact puffer fish Takifugu rubripes genome was obtained recently and this survey provided a preliminary catalogue of fish genes 4 . However, the Takifugu assembly is highly fragmented and as a result important questions could not be addressed.Here, we describe and analyse the genome sequence of the freshwater puffer fish Tetraodon nigroviridis with long-range linkage and extensive anchoring to chromosomes. Tetraodon resembles Takifugu in that it possesses one of the smallest known vertebrate genomes, but as a popular aquarium fish it is readily available and is easily maintained in tap water (see Supplementary Notes for naming conventions, natural habitat and phylogeny). The two puffer fish diverged from a common ancestor between 18-30 million years (Myr) ago and from the common ancestor with mammals about 450 Myr ago 5 . This long evolutionary distance provides a good contrast to distinguish conserved features from neutrally evolving DNA by sequence comparison. Tetraodon sequences in fact had an important role in providing a reliable estimate of the number of genes in the human genome 6 . There has been a vigorous and unresolved debate as to whether a whole-genome duplication (WGD) occurred in the ray-finned fish (actinopterygians) lineage after its separation from tetrapods [7][8][9] . By exploiting the extensive anchoring of the Tetraodon sequence to chromosomes, we provide a definitive answer to this question. The distribution of duplicated genes in t...
Ralstonia solanacearum is a devastating, soil-borne plant pathogen with a global distribution and an unusually wide host range. It is a model system for the dissection of molecular determinants governing pathogenicity. We present here the complete genome sequence and its analysis of strain GMI1000. The 5.8-megabase (Mb) genome is organized into two replicons: a 3.7-Mb chromosome and a 2.1-Mb megaplasmid. Both replicons have a mosaic structure providing evidence for the acquisition of genes through horizontal gene transfer. Regions containing genetically mobile elements associated with the percentage of G+C bias may have an important function in genome evolution. The genome encodes many proteins potentially associated with a role in pathogenicity. In particular, many putative attachment factors were identified. The complete repertoire of type III secreted effector proteins can be studied. Over 40 candidates were identified. Comparison with other genomes suggests that bacterial plant pathogens and animal pathogens harbour distinct arrays of specialized type III-dependent effectors.
ATP-binding cassette (ABC) systems, also called traffic ATPases, are found in eukaryotes and prokaryotes and almost all participate in the transport of a wide variety of molecules. ABC systems are characterized by a highly conserved ATPase module called here the ABC module, involved in coupling transport to ATP hydrolysis. We have used the sequence of one of the first representatives of bacterial ABC transporters, the MalK protein, to collect 250 closely related sequences from a nonredundant protein sequence database. The sequences collected by this objective method are all known or putative ABC transporters. After having eliminated short protein sequences and duplicates, the 197 remaining sequences were subjected to a phylogenetic analysis based on a mutational similarity matrix. An unrooted tree for these modules was found to display two major branches, one grouping all collected uptake systems and the other all collected export systems. This remarkable disposition strongly suggests that the divergence between these two functionally different types of ABC systems occurred once in the history of these systems and probably before the differentiation of prokaryotes and eukaryotes. We discuss the implications of this finding and we propose a model accounting for the generation and the diversification of ABC systems.
Despite a rapid increase in the amount of available archaeal sequence information, little is known about the duplication of genetic material in the third domain of life. We identified a single origin of bidirectional replication in Pyrococcus abyssi by means of in silico analyses of cumulative oligomer skew and the identification of an early replicating chromosomal segment. The replication origin in three Pyrococcus species was found to be highly conserved, and several eukaryotic-like DNA replication genes were clustered around it. As in Bacteria, the chromosomal region containing the replication terminus was a hot spot of genome shuffling. Thus, although bacterial and archaeal replication proteins differ profoundly, they are used to replicate chromosomes in a similar manner in both prokaryotic domains.
Retroviral vectors have induced subtle clonal skewing in many gene therapy patients and severe clonal proliferation and leukemia in some of them, emphasizing the need for comprehensive integration site analyses to assess the biosafety and genomic pharmacokinetics of vectors and clonal fate of gene-modified cells in vivo. Integration site analyses such as linear amplification-mediated PCR (LAM-PCR) require a restriction digest generating unevenly small fragments of the genome. Here we show that each restriction motif allows for identification of only a fraction of all genomic integrants, hampering the understanding and prediction of biological consequences after vector insertion. We developed a model to define genomic access to the viral integration site that provides optimal restriction motif combinations and minimizes the percentage of nonaccessible insertion loci. We introduce a new nonrestrictive LAM-PCR approach that has superior capabilities for comprehensive unbiased integration site retrieval in preclinical and clinical samples independent of restriction motifs and amplification inefficiency.
The number of genes in the human genome is unknown, with estimates ranging from 50,000 to 90,000 (refs 1, 2), and to more than 140,000 according to unpublished sources. We have developed 'Exofish', a procedure based on homology searches, to identify human genes quickly and reliably. This method relies on the sequence of another vertebrate, the pufferfish Tetraodon nigroviridis, to detect conserved sequences with a very low background. Similar to Fugu rubripes, a marine pufferfish proposed by Brenner et al. as a model for genomic studies, T. nigroviridis is a more practical alternative with a genome also eight times more compact than that of human. Many comparisons have been made between F. rubripes and human DNA that demonstrate the potential of comparative genomics using the pufferfish genome. Application of Exofish to the December version of the working draft sequence of the human genome and to Unigene showed that the human genome contains 28,000-34,000 genes, and that Unigene contains less than 40% of the protein-coding fraction of the human genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.