Emmanuel Courcelle scite author profile

SUMMARYRhizobium-induced root nodules are specialized organs for symbiotic nitrogen fixation. Indeterminate-type nodules are formed from an apical meristem and exhibit a spatial zonation which corresponds to successive developmental stages. To get a dynamic and integrated view of plant and bacterial gene expression associated with nodule development, we used a sensitive and comprehensive approach based upon oriented high-depth RNA sequencing coupled to laser microdissection of nodule regions. This study, focused on the association between the model legume Medicago truncatula and its symbiont Sinorhizobium meliloti, led to the production of 942 million sequencing read pairs that were unambiguously mapped on plant and bacterial genomes. Bioinformatic and statistical analyses enabled in-depth comparison, at a whole-genome level, of gene expression in specific nodule zones. Previously characterized symbiotic genes displayed the expected spatial pattern of expression, thus validating the robustness of our approach. We illustrate the use of this resource by examining gene expression associated with three essential elements of nodule development, namely meristem activity, cell differentiation and selected signaling processes related to bacterial Nod factors and redox status. We found that transcription factor genes essential for the control of the root apical meristem were also expressed in the nodule meristem, while the plant mRNAs most enriched in nodules compared with roots were mostly associated with zones comprising both plant and bacterial partners. The data, accessible on a dedicated website, represent a rich resource for microbiologists and plant biologists to address a variety of questions of both fundamental and applied interest.

show abstract

The InterPro Database, 2003 brings increased coverage and new features

Mulder

Apweiler²,

Attwood³

et al. 2003

647

382

View full text Add to dashboard Cite

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created in 1999 as a means of amalgamating the major protein signature databases into one comprehensive resource. PROSITE, Pfam, PRINTS, ProDom, SMART and TIGRFAMs have been manually integrated and curated and are available in InterPro for text- and sequence-based searching. The results are provided in a single format that rationalises the results that would be obtained by searching the member databases individually. The latest release of InterPro contains 5629 entries describing 4280 families, 1239 domains, 95 repeats and 15 post-translational modifications. Currently, the combined signatures in InterPro cover more than 74% of all proteins in SWISS-PROT and TrEMBL, an increase of nearly 15% since the inception of InterPro. New features of the database include improved searching capabilities and enhanced graphical user interfaces for visualisation of the data. The database is available via a webserver (http://www.ebi.ac.uk/interpro) and anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).

show abstract

New developments in the InterPro database

Mulder

Apweiler

Attwood

et al. 2007

Nucleic Acids Research

471

368

View full text Add to dashboard Cite

InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (), and for download by anonymous FTP (). The InterProScan search tool is now also available via a web service at .

show abstract

InterPro, progress and status in 2005

Mulder

Apweiler²,

Attwood³

et al. 2004

Nucleic Acids Research

493

343

View full text Add to dashboard Cite

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is provided in an abstract, Gene Ontology mapping and links to specialized databases. New features of InterPro include extended protein match views, taxonomic range information and protein 3D structure data. One of the new match views is the InterPro Domain Architecture view, which shows the domain composition of protein matches. Two new entry types were introduced to better describe InterPro entries: these are active site and binding site. PIRSF and the structure-based SUPERFAMILY are the latest member databases to join InterPro, and CATH and PANTHER are soon to be integrated. InterPro release 8.0 contains 11 007 entries, representing 2573 domains, 8166 families, 201 repeats, 26 active sites, 21 binding sites and 20 post-translational modification sites. InterPro covers over 78% of all proteins in the Swiss-Prot and TrEMBL components of UniProt. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).

show abstract

The ProDom database of protein domain families: more emphasis on 3D

Bru

Courcelle

Carrère

et al. 2004

Nucleic Acids Research

305

213

View full text Add to dashboard Cite

ProDom is a comprehensive database of protein domain families generated from the global comparison of all available protein sequences. Recent improvements include the use of three-dimensional (3D) information from the SCOP database; a completely redesigned web interface (http://www.toulouse.inra.fr/prodom.html); visualization of ProDom domains on 3D structures; coupling of ProDom analysis with the Geno3D homology modelling server; Bayesian inference of evolutionary scenarios for ProDom families. In addition, we have developed ProDom-SG, a ProDom-based server dedicated to the selection of candidate proteins for structural genomics.

show abstract

ProDom: Automated clustering of homologous domains

Servant

Bru²,

Carrère³

et al. 2002

Briefings in Bioinformatics

320

187

View full text Add to dashboard Cite

The ProDom database is a comprehensive set of protein domain families automatically generated from the SWISS-PROT and TrEMBL sequence databases. An associated database, ProDom-CG, has been derived as a restriction of ProDom to completely sequenced genomes. The ProDom construction method is based on iterative PSI-BLAST searches and multiple alignments are generated for each domain family. The ProDom web server provides the user with a set of tools to visualise multiple alignments, phylogenetic trees and domain architectures of proteins, as well as a BLAST-based server to analyse new sequences for homologous domains. The comprehensive nature of ProDom makes it particularly useful to help sustain the growth of InterPro.

show abstract

Reconstruction of monocotelydoneous proto-chromosomes reveals faster evolution in plants than in animals

Salse

Abrouk

Bolot

et al. 2009

Proc. Natl. Acad. Sci. U.S.A.

135

146

View full text Add to dashboard Cite

Paleogenomics seeks to reconstruct ancestral genomes from the genes of today's species. The characterization of paleo-duplications represented by 11,737 orthologs and 4,382 paralogs identified in five species belonging to three of the agronomically most important subfamilies of grasses, that is, Ehrhartoideae (rice) Panicoideae (sorghum, maize), and Pooideae (wheat, barley), permitted us to propose a model for an ancestral genome with a minimal size of 33.6 Mb structured in five proto-chromosomes containing at least 9,138 predicted proto-genes. It appears that only four major evolutionary shuffling events (␣, ␤, ␥, and ␦) explain the divergence of these five cereal genomes during their evolution from a common paleo-ancestor. Comparative analysis of ancestral gene function with rice as a reference indicated that five categories of genes were preferentially modified during evolution. Furthermore, alignments between the five grass proto-chromosomes and the recently identified seven eudicot proto-chromosomes indicated that additional very active episodes of genome rearrangements and gene mobility occurred during angiosperm evolution. If one compares the pace of primate evolution of 90 million years (233 species) to 60 million years of the Poaceae (10,000 species), change in chromosome structure through speciation has accelerated significantly in plants.grasses ͉ paleogenomics P aleogenomics, the study of ancestral genome structures, allows the identification and characterization of mechanisms (e.g., duplications, translocations, and inversions) that have shaped genome species during their evolution and provides a framework to better integrate results from genetics, genomics, and comparative analyses. Studies of fossils and lower taxa organisms [Neanderthal (1), Echinoderms (2), Mammoth (3), Sponge (4), and Moss (5)] have yielded unprecedented information on the evolution of animal species and the relationships between them. When fossil DNA is not available, paleogenomics can be performed through largescale comparative analyses of actual species and through ancestor modeling.In silico colinearity studies and ancestral genome reconstruction in mammals have been facilitated by a generally moderate reshuffling of chromosomal segments since their divergence from a common ancestor Ϸ130 million years ago (mya) (6-9). Recently, Nakatani et al. (10) provided an integrated view of vertebrate paleogenomics with an ancestor of 10 to 13 proto-chromosomes. In contrast to mammals, paleogenomics has been poorly investigated in plants as angiosperm species have undergone serial whole genome or segmental duplications, diploidization, small-scale rearrangements (translocations, gene conversions), and gene copying events that make comparative studies between and within the monocotyledon (mainly grasses) and eudicot families very challenging. For the eudicots, two scenarios based on comparisons between the grape, Arabidopsis thaliana, and poplar genome sequences have been proposed recently. In the first one, the eudicots were proposed t...

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.