Bacillus subtilis is the best-characterized member of the Gram-positive bacteria. Its genome of 4,214,810 base pairs comprises 4,100 protein-coding genes. Of these protein-coding genes, 53% are represented once, while a quarter of the genome corresponds to several gene families that have been greatly expanded by gene duplication, the largest family containing 77 putative ATP-binding transport proteins. In addition, a large proportion of the genetic capacity is devoted to the utilization of a variety of carbon sources, including many plant-derived molecules. The identification of five signal peptidase genes, as well as several genes for components of the secretion apparatus, is important given the capacity of Bacillus strains to secrete large amounts of industrially important enzymes. Many of the genes are involved in the synthesis of secondary metabolites, including antibiotics, that are more typically associated with Streptomyces species. The genome contains at least ten prophages or remnants of prophages, indicating that bacteriophage infection has played an important evolutionary role in horizontal gene transfer, in particular in the propagation of bacterial pathogenesis.
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
The plant Arabidopsis thaliana (Arabidopsis) has become an important model species for the study of many aspects of plant biology. The relatively small size of the nuclear genome and the availability of extensive physical maps of the five chromosomes provide a feasible basis for initiating sequencing of the five chromosomes. The YAC (yeast artificial chromosome)-based physical map of chromosome 4 was used to construct a sequence-ready map of cosmid and BAC (bacterial artificial chromosome) clones covering a 1.9-megabase (Mb) contiguous region, and the sequence of this region is reported here. Analysis of the sequence revealed an average gene density of one gene every 4.8 kilobases (kb), and 54% of the predicted genes had significant similarity to known genes. Other interesting features were found, such as the sequence of a disease-resistance gene locus, the distribution of retroelements, the frequent occurrence of clustered gene families, and the sequence of several classes of genes not previously encountered in plants.
Southern blot screening of a genomic Helicobacter pylori library was employed to find a P type ATPase using a mixture of 16 DNA oligonucleotides coding for the DKTGT(I/L)T consensus sequence specific for the phosphorylation site of this family of ATPases. A positive clone, pRH439, was isolated and sequenced. The inserted 3.4-kb H. pylori DNA contained an intact open reading frame encoding a protein of 686 amino acids carrying the consensus sites for phosphorylation and ATP binding. The amino acid sequence exhibits a 25-30% identity with bacterial Cd2+ and Cu2+ ATPases. Genomic Southern blot analysis showed that this ATPase was present in all H. pylori strains examined, whereas it was not detectable in Campylobacter jejuni and other bacteria. The membrane topology of this ATPase was investigated using in vitro transcription/translation of fusion vectors to find signal anchor and/or stop transfer sequences. Eight regions of the H. pylori ATPase acted as signal anchor and/or stop transfer sequences and were ordered pairwise along the polypeptide chain placing the N and C-terminal amino acids in the cytoplasm. These transmembrane segments are contained between positions 73 and 92 (H1), 98 and 125 (H2), 128 and 148 (H3), 149 and 176 (H4), 309 and 327 (H5), 337 and 371 (H6), 637 and 658 (H7), and 659 and 685 (H8). The membrane domain of the ATPase, therefore, consists of at least four pairs of transmembrane segments with the phosphorylation site and ATP binding domain located in the large cytoplasmic loop between H6 and H7. The cytoplasmic domain contains several histidines and cysteines, perhaps indicative of divalent cation binding sites. There are several charged amino acids (3 Lys, 2 Glu, 2 Asp), predicted to be in the membrane domain mainly in H2, H3, and H4 and a Cys-Pro-Cys putative metal ion site in H6. The extracytoplasmic domain also has several charged amino acids (5 Glu, 1 Asp, 1 Lys, 1 Arg). It is likely that this novel protein is a heavy metal cation transporting ATPase and belongs to a family of P type ATPases containing eight transmembrane segments.
Nuclear DEAD box protein p68 is immunologically related to SV40 large tumour antigen and both proteins possess RNA helicase activity. In this report, we describe the structural organisation of the human p68 gene and aspects of the regulation of its expression. Northern blot and primer extension analyses indicate that, although its level is variable, the p68 RNA helicase appears to be expressed from a single transcription start site in all tissues tested. Sequence analysis revealed that the p68 promoter harbours a 'TATA', a 'CAAT' and an initiator element and contains high affinity binding sites for Sp1, AP-2, CRE and Myc. This and functional promoter analyses in transient expression assays suggest that transcriptional regulation of the p68 gene is complex. Furthermore, there are indications that p68 expression is also regulated post-transcriptionally. Steady-state pools of poly(A)(+)RNA from human cells contain completely spliced p68 mRNA and alternatively spliced forms that contain introns 8-11 or 8-12 (from a total of 12 introns) and are not translated. Analysis of a conditionally p68-overproducing HeLa cell line points to negative autoregulation at the level of splicing, which is confirmed by a recently reported association of p68 with spliceosomes in human cells.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.