The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
The TRANSFAC® database on transcription factors, their binding sites, nucleotide distribution matrices and regulated genes as well as the complementing database TRANSCompel® on composite elements have been further enhanced on various levels. A new web interface with different search options and integrated versions of Match™ and Patch™ provides increased functionality for TRANSFAC®. The list of databases which are linked to the common GENE table of TRANSFAC® and TRANSCompel® has been extended by: Ensembl, UniGene, EntrezGene, HumanPSD™ and TRANSPRO™. Standard gene names from HGNC, MGI and RGD, are included for human, mouse and rat genes, respectively. With the help of InterProScan, Pfam, SMART and PROSITE domains are assigned automatically to the protein sequences of the transcription factors. TRANSCompel® contains now, in addition to the COMPEL table, a separate table for detailed information on the experimental EVIDENCE on which the composite elements are based. Finally, for TRANSFAC®, in respect of data growth, in particular the gain of Drosophila transcription factor binding sites (by courtesy of the Drosophila DNase I footprint database) and of Arabidopsis factors (by courtesy of DATF, Database of Arabidopsis Transcription Factors) has to be stressed. The here described public releases, TRANSFAC® 7.0 and TRANSCompel® 7.0, are accessible under .
The TRANSFAC database on eukaryotic transcriptional regulation, comprising data on transcription factors, their target genes and regulatory binding sites, has been extended and further developed, both in number of entries and in the scope and structure of the collected data. Structured fields for expression patterns have been introduced for transcription factors from human and mouse, using the CYTOMER database on anatomical structures and developmental stages. The functionality of Match, a tool for matrix-based search of transcription factor binding sites, has been enhanced. For instance, the program now comes along with a number of tissue-(or state-)specific profiles and new profiles can be created and modified with Match Profiler. The GENE table was extended and gained in importance, containing amongst others links to LocusLink, RefSeq and OMIM now. Further, (direct) links between factor and target gene on one hand and between gene and encoded factor on the other hand were introduced. The TRANSFAC public release is available at http://www.gene-regulation.com. For yeast an additional release including the latest data was made available separately as TRANSFAC Saccharomyces Module (TSM) at http://transfac.gbf.de. For CYTOMER free download versions are available at http://www.biobase.de:8080/index.html.
Chromosome 21 is the smallest human autosome. An extra copy of chromosome 21 causes Down syndrome, the most frequent genetic cause of significant mental retardation, which affects up to 1 in 700 live births. Several anonymous loci for monogenic disorders and predispositions for common complex disorders have also been mapped to this chromosome, and loss of heterozygosity has been observed in regions associated with solid tumours. Here we report the sequence and gene catalogue of the long arm of chromosome 21. We have sequenced 33,546,361 base pairs (bp) of DNA with very high accuracy, the largest contig being 25,491,867 bp. Only three small clone gaps and seven sequencing gaps remain, comprising about 100 kilobases. Thus, we achieved 99.7% coverage of 21q. We also sequenced 281,116 bp from the short arm. The structural features identified include duplications that are probably involved in chromosomal abnormalities and repeat structures in the telomeric and pericentromeric regions. Analysis of the chromosome revealed 127 known genes, 98 predicted genes and 59 pseudogenes.
Sigma factors are essential global regulators of transcription initiation in bacteria which confer promoter recognition specificity to the RNA polymerase core enzyme. They provide effective mechanisms for simultaneously regulating expression of large numbers of genes in response to challenging conditions, and their presence has been linked to bacterial virulence and pathogenicity. In this study, we constructed nine his-tagged sigma factor expressing and/or deletion mutant strains in the opportunistic pathogen Pseudomonas aeruginosa. To uncover the direct and indirect sigma factor regulons, we performed mRNA profiling, as well as chromatin immunoprecipitation coupled to high-throughput sequencing. We furthermore elucidated the de novo binding motif of each sigma factor, and validated the RNA- and ChIP-seq results by global motif searches in the proximity of transcriptional start sites (TSS). Our integrated approach revealed a highly modular network architecture which is composed of insulated functional sigma factor modules. Analysis of the interconnectivity of the various sigma factor networks uncovered a limited, but highly function-specific, crosstalk which orchestrates complex cellular processes. Our data indicate that the modular structure of sigma factor networks enables P. aeruginosa to function adequately in its environment and at the same time is exploited to build up higher-level functions by specific interconnections that are dominated by a participation of RpoN.
Phenotypic variability among bacteria depends on gene expression in response to different environments, and it also reflects differences in genomic structure. In this study, we analyzed transcriptome sequencing (RNA-seq) profiles of 151 Pseudomonas aeruginosa clinical isolates under standard laboratory conditions and of one P. aeruginosa type strain under 14 different environmental conditions. Our approach allowed dissection of the impact of the genetic background versus environmental cues on P. aeruginosa gene expression profiles and revealed that phenotypic variation was larger in response to changing environments than between genomically different isolates. We demonstrate that mutations within the global regulator LasR affect more than one trait (pleiotropy) and that the interaction between mutations (epistasis) shapes the P. aeruginosa phenotypic plasticity landscape. Because of pleiotropic and epistatic effects, average genotype and phenotype measures appeared to be uncorrelated in P. aeruginosa.
In T-cell acute lymphoblastic leukemia, alternative t(5;14)-(q35;q32.2) forms effect dysregulation of either TLX3 or NKX2-5 homeobox genes at 5q35 by juxtaposition with 14q32.2 breakpoints dispersed across the BCL11B downstream genomic desert. Leukemic gene dysregulation by t(5;14) was investigated by DNA inhibitory treatments with 26-mer double-stranded DNA oligonucleotides directed against candidate enhancers at, or near, orphan T-cell DNase I hypersensitive sites located between 3 ¶-BCL11B and VRK1. NKX2-5 down-regulation in t(5;14) PEER cells was almost entirely restricted to DNA inhibitory treatment targeting enhancers within the distal breakpoint cluster region and was dose and sequence dependent, whereas enhancers near 3 ¶-BCL11B regulated that gene only. Chromatin immunoprecipitation assays showed that the four most effectual NKX2-5 ectopic enhancers were hyperacetylated. These enhancers clustered f1 Mbp downstream of BCL11B, within a region displaying multiple regulatory stigmata, including a TCRA enhancer motif, deep sequence conservation, and tight nuclear matrix attachment relaxed by trichostatin A treatment. Intriguingly, although TLX3/NKX2-5 promoter/exon 1 regions were hypoacetylated, their expression was trichostatin A sensitive, implying extrinsic regulation by factor(s) under acetylation control. Knockdown of PU.1, known to be trichostatin A responsive and which potentially binds TLX3/NKX2-5 promoters, effected down-regulation of both homeobox genes. Moreover, genomic analysis showed preferential enrichment near ectopic enhancers of binding sites for the PU.1 cofactor HMGA1, the knockdown of which also inhibited NKX2-5. We suggest that HMGA1 and PU
Extensive use of next-generation sequencing (NGS) for pathogen profiling has the potential to transform our understanding of how genomic plasticity contributes to phenotypic versatility. However, the storage of large amounts of NGS data and visualization tools need to evolve to offer the scientific community fast and convenient access to these data. We introduce BACTOME as a database system that links aligned DNA- and RNA-sequencing reads of clinical Pseudomonas aeruginosa isolates with clinically relevant pathogen phenotypes. The database allows data extraction for any single isolate, gene or phenotype as well as data filtering and phenotypic grouping for specific research questions. With the integration of statistical tools we illustrate the usefulness of a relational database structure for the identification of phenotype–genotype correlations as an essential part of the discovery pipeline in genomic research. Furthermore, the database provides a compilation of DNA sequences and gene expression values of a plethora of clinical isolates to give a consensus DNA sequence and consensus gene expression signature. Deviations from the consensus thereby describe the genomic landscape and the transcriptional plasticity of the species P. aeruginosa. The database is available at https://bactome.helmholtz-hzi.de.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.