ArrayExpress http://www.ebi.ac.uk/arrayexpress consists of three components: the ArrayExpress Repository—a public archive of functional genomics experiments and supporting data, the ArrayExpress Warehouse—a database of gene expression profiles and other bio-measurements and the ArrayExpress Atlas—a new summary database and meta-analytical tool of ranked gene expression across multiple experiments and different biological conditions. The Repository contains data from over 6000 experiments comprising approximately 200 000 assays, and the database doubles in size every 15 months. The majority of the data are array based, but other data types are included, most recently—ultra high-throughput sequencing transcriptomics and epigenetic data. The Warehouse and Atlas allow users to query for differentially expressed genes by gene names and properties, experimental conditions and sample properties, or a combination of both. In this update, we describe the ArrayExpress developments over the last two years.
Summary• One approach for investigating the molecular basis of wood formation is to integrate microarray profiling data sets and sequence analyses, comparing tree species with model plants such as Arabidopsis. Conifers may be included in comparative studies thanks to large-scale expressed sequence tag (EST) analyses, which enable the development of cDNA microarrays with very significant genome coverage.• A microarray of 10 400 low-redundancy sequences was designed starting from white spruce (Picea glauca (Moench.) Voss) cDNAs. Computational procedures that were developed to ensure broad transcriptome coverage and efficient PCR amplification were used to select cDNA clones, which were re-sequenced in the microarray manufacture process.• White spruce transcript profiling experiments that compared secondary xylem to phloem and needles identified 360 xylem-preferential gene sequences. The functional annotations of all differentially expressed sequences were highly consistent with the results of similar analyses carried out in angiosperm trees and herbaceous plants.• Computational analyses comparing the spruce microarray sequences and core xylem gene sets from Arabidopsis identified 31 transcripts that were highly conserved in angiosperms and gymnosperms, in terms of both sequence and xylem expression. Several other spruce sequences have not previously been linked to xylem differentiation (including genes encoding TUBBY-like domain proteins (TLPs) and a gibberellin insensitive (gai) gene sequence) or were shown to encode proteins of unknown function encompassing diverse conserved domains of unknown function.
In the major animal model species like mouse, fish or fly, detailed spatial information on gene expression over time can be acquired through whole mount in situ hybridization experiments. In these species, expression patterns of many genes have been studied and data has been integrated into dedicated model organism databases like ZFIN for zebrafish, MEPD for medaka, BDGP for Drosophila or GXD for mouse. However, a central repository that allows users to query and compare gene expression patterns across different species has not yet been established. Therefore, we have integrated expression patterns for zebrafish, Drosophila, medaka and mouse into a central public repository called 4DXpress (expression database in four dimensions). Users can query anatomy ontology-based expression annotations across species and quickly jump from one gene to the orthologues in other species. Genes are linked to public microarray data in ArrayExpress. We have mapped developmental stages between the species to be able to compare developmental time phases. We store the largest collection of gene expression patterns available to date in an individual resource, reflecting 16 505 annotated genes. 4DXpress will be an invaluable tool for developmental as well as for computational biologists interested in gene regulation and evolution. 4DXpress is available at http://ani.embl.de/4DXpress.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.