As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at ∼58% compared with a peak at ∼42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at ∼42%, relatively low compared with that of protein-coding cDNAs.
The human genome is thought to harbor 50,000 to 100,000 genes, of which about half have been sampled to date in the form of expressed sequence tags. An international consortium was organized to develop and map gene-based sequence tagged site markers on a set of two radiation hybrid panels and a yeast artificial chromosome library. More than 16,000 human genes have been mapped relative to a framework map that contains about 1000 polymorphic genetic markers. The gene map unifies the existing genetic and physical maps with the nucleotide and protein sequence databases in a fashion that should speed the discovery of genes underlying inherited human disease. The integrated resource is available through a site on the World Wide Web at http://www.ncbi.nlm.nih.gov/SCIENCE96/.
A novel v-erb-B-related gene, c-erb-B-2, which has been identified in the human genome, maps to human chromosome 17 at q21 (ref. 40), and seems to encode a polypeptide with a kinase domain that is highly homologous with, but distinct from, that of the epidermal growth factor (EGF) receptor. The c-erb-B-2 gene is conserved in vertebrates and it has been suggested that the neu gene, detected in a series of rat neuro/glioblastomas, is, in fact, the rat c-erb-B-2 gene. Amplification of the c-erb-B-2 gene in a salivary adenocarcinoma and a gastric cancer cell line MKN-7 suggests that its over-expression is sometimes involved in the neoplastic process. To determine the nature of the c-erb-B-2 protein, we have now molecularly cloned complementary DNA for c-erb-B-2 messenger RNA prepared from MKN-7 cells. Its sequence shows that the c-erb-B-2 gene encodes a possible receptor protein and allows an analysis of the similarity of the protein to the EGF receptor and the neu product. As a consequence of chromosomal aberration in MKN-7 cells, a 4.6-kilobase (kb) normal transcript and a truncated 2.3-kb transcript of c-erb-B-2 are synthesized at elevated levels. The latter transcript presumably encodes only the extracellular domain of the putative receptor.
The small guanosine triphosphatases (GTPases) Cdc42 and Rac1 regulate E-cadherin-mediated cell-cell adhesion. IQGAP1, a target of Cdc42 and Rac1, was localized with E-cadherin and beta-catenin at sites of cell-cell contact in mouse L fibroblasts expressing E-cadherin (EL cells), and interacted with E-cadherin and beta-catenin both in vivo and in vitro. IQGAP1 induced the dissociation of alpha-catenin from a cadherin-catenin complex in vitro and in vivo. Overexpression of IQGAP1 in EL cells, but not in L cells expressing an E-cadherin-alpha-catenin chimeric protein, resulted in a decrease in E-cadherin-mediated cell-cell adhesive activity. Thus, IQGAP1, acting downstream of Cdc42 and Rac1, appears to regulate cell-cell adhesion through the cadherin-catenin pathway.
A map of 30,181 human gene-based markers was assembled and integrated with the current genetic map by radiation hybrid mapping. The new gene map contains nearly twice as many genes as the previous release, includes most genes that encode proteins of known function, and is twofold to threefold more accurate than the previous version. A redesigned, more informative and functional World Wide Web site (www.ncbi.nlm.nih.gov/genemap) provides the mapping information and associated data and annotations. This resource constitutes an important infrastructure and tool for the study of complex genetic traits, the positional cloning of disease genes, the cross-referencing of mammalian genomes, and validated human transcribed sequences for large-scale studies of gene expression.
Cdc42 and Rac1 have been implicated in the regulation of various cell functions such as cell morphology, polarity, and cell proliferation. We have partially purified a Cdc42- and Rac1-associated protein with molecular mass of about 170 kDa (p170) from bovine brain cytosol. This protein interacted with guanosine 5'-(3-O-thio)triphosphate (GTPgammaS).glutathione S-transferase (GST)-Cdc42 and GTPgammaS++.GST-Rac1 but not with the GDP.GST-Cdc42, GDP.GST-Rac1, or GTPgammaS.GST-RhoA). We identified p170 as an IQGAP, which is originally identified as a putative Ras GTPase-activating protein. Recombinant IQGAP specifically interacted with GTPgammaS.Cdc42 and GTPgammaS.Rac1. The C-terminal fragment of IQGAP was responsible for their interactions. IQGAP was specifically immunoprecipitated with dominant-active Cdc42(Val12) or Rac1(Val12) from the COS7 cells expressing Cdc42(Val12) or Rac1(Val12), respectively. Immunofluorescence analysis revealed that IQGAP was accumulated at insulin- or Rac1-induced membrane ruffling areas. This accumulation of IQGAP was blocked by the microinjection of the dominant-negative Rac1(Asn17) or Cdc42(Asn17). Moreover, IQGAP was accumulated at the cell-cell junction in MDCK cells, where alpha-catenin and ZO-1 were localized. These results suggest that IQGAP is a novel target molecule for Cdc42 and Rac1.
The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.
cDNA clones of ski and the ski-related gene, sno, were obtained by screening human cDNA libraries. The predicted open reading frame of h-ski could encode a protein of 728 amino acid residues. The h-ski protein is highly homologous with the v-ski protein. The overall homology between h-ski and v-ski is 91% at the amino acid level. DNA sequencing analysis revealed two types of cDNA clones from the sno (ski-related novel gene) gene, possibly due to alternative splicing. The first type, named snoN (non Alu-containing), encoded a protein of 684 amino acid residues. The second type, named snoA (Alu-containing), encoded a protein of 415 amino acid residues. The first 366 amino acid residues of snoN and snoA are the same, but subsequent amino acids show divergence. Several transcripts of h-ski (6.0, 4.7, 3.8, 3.0, 2.1 and 1.8kb) were detected. The mRNAs of h-sno were 6.2, 4.4 and 3.2kb.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.