Protein interaction mapping using large-scale two-hybrid analysis has been proposed as a way to functionally annotate large numbers of uncharacterized proteins predicted by complete genome sequences. This approach was examined in Caenorhabditis elegans, starting with 27 proteins involved in vulval development. The resulting map reveals both known and new potential interactions and provides a functional annotation for approximately 100 uncharacterized gene products. A protein interaction mapping project is now feasible for C. elegans on a genome-wide scale and should contribute to the understanding of molecular mechanisms in this organism and in human diseases.
To verify the genome annotation and to create a resource to functionally characterize the proteome, we attempted to Gateway-clone all predicted protein-encoding open reading frames (ORFs), or the 'ORFeome,' of Caenorhabditis elegans. We successfully cloned approximately 12,000 ORFs (ORFeome 1.1), of which roughly 4,000 correspond to genes that are untouched by any cDNA or expressed-sequence tag (EST). More than 50% of predicted genes needed corrections in their intron-exon structures. Notably, approximately 11,000 C. elegans proteins can now be expressed under many conditions and characterized using various high-throughput strategies, including large-scale interactome mapping. We suggest that similar ORFeome projects will be valuable for other organisms, including humans.
Down-regulation of mitogenic signaling in mammalian cells relies in part on endosomal trafficking of activated receptors into lysosomes, where the receptors are degraded. These events are mediated by ubiquitination of the endosomal cargo and its consequent sorting into multivesicular bodies that form at the surfaces of late endosomes. Tumor susceptibility gene 101 (tsg101) recently was found to be centrally involved in this process. Here we report that TSG101 interacts with hepatocyte growth factor-regulated tyrosine kinase substrate (HRS), an early endosomal protein, and that disruption of this interaction impedes endosomal trafficking and endocytosis-mediated degradation of mitogenic receptors. TSG101͞HRS interaction occurs between a ubiquitin-binding domain of TSG101 and two distinct proline-rich regions of HRS, and is modulated by a C-terminal TSG101 sequence that resembles a motif targeted in HRS. Mutational perturbation of TSG101͞HRS interaction prevented delivery of epidermal growth factor receptor (EGFR) to late endosomes, resulted in the cellular accumulation of ubiquitinated EGFR in early endosomes, and inhibited ligandinduced down-regulation of EGFR. Our results reveal the TSG101 interaction with HRS as a crucial step in endocytic down-regulation of mitogenic signaling and suggest a role for this interaction in linking the functions of early and late endosomes.epidermal growth factor receptor ͉ endocytosis ͉ multivesicular body ͉ endosomes
The genome sequences of Caenorhabditis elegans, Drosophila melanogaster and Arabidopsis thaliana have been predicted to contain 19,000, 13,600 and 25,500 genes, respectively. Before this information can be fully used for evolutionary and functional studies, several issues need to be addressed. First, the gene number estimates obtained in silico and not yet supported by any experimental data need to be verified. For example, it seems biologically paradoxical that C. elegans would have 50% more genes than Drosophilia. Second, intron/exon predictions need to be tested experimentally. Third, complete sets of open reading frames (ORFs), or "ORFeomes," need to be cloned into various expression vectors. To address these issues simultaneously, we have designed and applied to C. elegans the following strategy. Predicted ORFs are amplified by PCR from a highly representative cDNA library using ORF-specific primers, cloned by Gateway recombination cloning and then sequenced to generate ORF sequence tags (OSTs) as a way to verify identity and splicing. In a sample (n=1,222) of the nearly 10,000 genes predicted ab initio (that is, for which no expressed sequence tag (EST) is available so far), at least 70% were verified by OSTs. We also observed that 27% of these experimentally confirmed genes have a structure different from that predicted by GeneFinder. We now have experimental evidence that supports the existence of at least 17,300 genes in C. elegans. Hence we suggest that gene counts based primarily on ESTs may underestimate the number of genes in human and in other organisms.
The ability to clone and manipulate DNA segments is central to molecular methods that enable expression, screening, and functional characterization of genes, proteins, and regulatory elements. We previously described the development of a novel technology that utilizes in vitro site-specific recombination to provide a robust and flexible platform for high-throughput cloning and transfer of DNA segments. By using an expanded repertoire of recombination sites with unique specificities, we have extended the technology to enable the high-efficiency in vitro assembly and concerted cloning of multiple DNA segments into a vector backbone in a predefined order, orientation, and reading frame. The efficiency and flexibility of this approach enables collections of functional elements to be generated and mixed in a combinatorial fashion for the parallel assembly of numerous multi-segment constructs. The assembled constructs can be further manipulated by directing exchange of defined segments with alternate DNA segments. In this report, we demonstrate feasibility of the technology and application to the generation of fusion proteins, the linkage of promoters to genes, and the assembly of multiple protein domains. The technology has broad implications for cell and protein engineering, the expression of multidomain proteins, and gene function analysis.[Supplemental material is available online at www.genome.org.]The cloning and manipulation of DNA segments, typically encoding functional elements such as promoters, genes, protein domains, or fusion tags, are central to methods of cell engineering, protein production, and gene-function analysis. The large number of available genome sequences now makes it possible to create and apply repositories of defined functional elements to conduct high-throughput, genome-wide analyses. The Gateway Cloning Technology (Hartley et al. 2000) uses in vitro sitespecific recombination to clone and subsequently transfer DNA segments between vector backbones. This approach has been used to generate several large clone collections (Entry Clones), in some cases comprising the entire or nearly entire coding capacity of model genomes as open reading frames (ORFs). These ORFeomes include Caenorhabditis elegans (Walhout et al. 2000b;Reboul et al. 2001Reboul et al. , 2003, Pseudomonas aeruginosa (LaBaer et al. 2004), and Saccharomyces cerevisiae (G. Marsischky, pers. comm.), Arabidopsis (Yamada et al. 2003; also see Atome project http:// genoplante-info.infobiogen.fr/Databases/CT_Nouveaux_Outils/ NO2001054/), human (clones available from several commercial sources), and an incipient collection of Drosophila ORFs (http:// www.fruitfly.org/EST/gateway.shtml). A collection of sequenced, full-length Arabidopsis cDNAs in the Gateway Vector pCMV-SPORT6 will shortly be made available through INRA-Genoscope (Castelli et al. 2004). Repositories of full-length clones, some of which are in the Gateway format, are available for Xenopus (http://xgc.nci.nih.gov/), zebrafish (http://zgc.nci.nih.gov/), as well as many hu...
DNA involved in the mobilization of broad-host-range plasmid R1162 was localized to a region of 2.7 kilobases within coordinates 3.4 to 6.1 kilobases on the R1162 map. By examining the transfer properties of plasmids containing cloned fragments of DNA from within this region, we showed that at least four trans-active products and a cis-active site (oriT) were involved in mobilization. A cloned DNA fragment of 155 base pairs was capable of providing full onrT activity. This fragment was located within 600 base pairs of DNA containing the origin of replication of R1162, and its nucleotide sequence and that of neighboring DNA were determined. Activation of oriT required R1162-encoded, trans-acting products. Deletions which resulted in the loss of one or more of these had a variable effect on transfer efficiency and indicated the presence of both essential and nonessential Mob products. Regions encoding these products flanked onrT and in one case appeared to overlap a gene essential for plasmid replication. The implications of these findings with respect to the broad host range of R1162 are discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.