The superfamily of G-protein-coupled receptors (GPCRs) is very diverse in structure and function and its members are among the most pursued targets for drug development. We identified more than 800 human GPCR sequences and simultaneously analyzed 342 unique functional nonolfactory human GPCR sequences with phylogenetic analyses. Our results show, with high bootstrap support, five main families, named glutamate, rhodopsin, adhesion, frizzled/taste2, and secretin, forming the GRAFS classification system. The rhodopsin family is the largest and forms four main groups with 13 subbranches. Positions of the GPCRs in chromosomal paralogons regions indicate the importance of tetraploidizations or local gene duplication events for their creation. We also searched for "fingerprint" motifs using Hidden Markov Models delineating the putative inter-relationship of the GRAFS families. We show several common structural features indicating that the human GPCRs in the GRAFS families share a common ancestor. This study represents the first overall map of the GPCRs in a single mammalian genome. Our novel approach of analyzing such large and diverse sequence sets may be useful for studies on GPCRs in other genomes and divergent protein families.
Many chromosome regions in the human genome exist in four similar copies, suggesting that the entire genome was duplicated twice in early vertebrate evolution, a concept called the 2R hypothesis. Forty-two gene families on the four Hox-bearing chromosomes were recently analyzed by others, and 32 of these were reported to have evolutionary histories incompatible with duplications concomitant with the Hox clusters, thereby contradicting the 2R hypothesis. However, we show here that nine of the families have probably been translocated to the Hox-bearing chromosomes more recently, and that three of these belong to other chromosome quartets where they actually support the 2R hypothesis. We consider 13 families too complex to shed light on the chromosome duplication hypothesis. Among the remaining 20 families, 14 display phylogenies that support or are at least consistent with the Hox-cluster duplications. Only six families seem to have other phylogenies, but these trees are highly uncertain due to shortage of sequence information. We conclude that all relevant and analyzable families support or are consistent with block/chromosome duplications and that none clearly contradicts the 2R hypothesis.
BackgroundOne of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY) receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes) and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10.ResultsChromosome regions with conserved synteny were identified and confirmed by phylogenetic analyses in H. sapiens, M. musculus, D. rerio, T. rubripes and T. nigroviridis. 26 gene families, including the NPY receptor genes, (plus 3 described recently by other labs) showed a tree topology consistent with duplications in early vertebrate evolution and in the actinopterygian lineage, thereby supporting expansion through block duplications. Eight gene families had complications that precluded analysis (such as short sequence length or variable number of repeated domains) and another eight families did not support block duplications (because the paralogs in these families seem to have originated in another time window than the proposed genome duplication events). RT-PCR carried out with several tissues in T. rubripes revealed that all five NPY receptors were expressed in the brain and subtypes Y2, Y4 and Y8 were also expressed in peripheral organs.ConclusionWe conclude that the phylogenetic analyses and chromosomal locations of these gene families support duplications of large blocks of genes or even entire chromosomes. Thus, these results are consistent with two early vertebrate tetraploidizations forming a paralogon comprising human chromosomes 4, 5, 8 and 10 and one teleost tetraploidization. The combination of positional and phylogenetic data further strengthens the identification of orthologs and paralogs in the NPY receptor family.
We have isolated a neurotrophin from the lamprey that permitted us to perform a phylogenetic analysis of the neurotrophin gene family that dates back more than 460 million years to the early vertebrate ancestors. The results show that the neurotrophin gene family was originally formed by two subsequent duplications. The duplication that formed nerve growth factor, neurotrophin-3, brain-derived neurotrophic factor, and neurotrophin-4/5 occurred after the split of lampreys but before the split of cartilaginous fish from the main vertebrate lineage. Compilation of chromosomal gene maps around the neurotrophins shows that they are located in paralogous regions, suggesting that the genes were formed at major duplication events possibly by complete genome doubling. Analysis of two isolated Trk receptor sequences shows similar results as for the lamprey neurotrophin. Multiple neurotrophin and Trk genes, including neurotrophin-6 and -7, have been found in bony fish, and we suggest that the extra genes were formed by an additional duplication in the bony fish lineage. Analysis of lamprey Trk mRNA expression in the adult brain shows that the genes are expressed in all regions analyzed so far. Together, the results suggest that the duplications of ancestral neurotrophin and Trk genes at an early vertebrate stage have permitted evolution to bring about differential neurotrophin and Trk expression, thereby allowing the formation of specific functions in selective neuronal populations.
Neuropeptide Y (NPY) receptors mediate a variety of physiological responses including feeding and vasoconstriction. To investigate the evolutionary events that have generated this receptor family, we have sequenced and determined the chromosomal localizations of all five presently known mammalian NPY receptor subtype genes in the domestic pig, Sus scrofa (SSC). The orthologs of the Y 1 and Y 2 subtypes display high amino acid sequence identities between pig, human, and mouse (92%-94%), whereas the Y 4 , Y 5 , and y 6 subtypes display lower identities (76%-87%). The lower identity of Y 5 is due to high sequence divergence in the large third intracellular loop. The NPY1R, NPY2R, and NPY5R receptor genes were localized to SSC8, the NPY4R to SSC14, and NPY6R to SSC2. Our comparisons strongly suggest that the tight cluster of NPY1R, NPY2R, and NPY5R on human chromosome 4 (HSA4) represents the ancestral configuration, whereas the porcine cluster has been split by two inversions on SSC8. These 3 genes, along with adjacent genes from 14 other gene families, form a cluster on HSA4 with extensive similarities to a cluster on HSA5, where NPY6R and >13 other paralogs reside, as well as another large cluster on HSA10 that includes NPY4R. Thus, these gene families have expanded through large-scale duplications. The sequence comparisons show that the NPY receptor triplet NPY1R-NPY2R-NPY5R existed before these large-scale duplications.
The aims of the study were to outline the sequence of events that gave rise to the vertebrate insulin-relaxin gene family and the chromosomal regions in which they reside. We analyzed the gene content surrounding the human insulin/relaxin genes with respect to what family they belonged to and if the duplication history of investigated families parallels the evolution of the insulin-relaxin family members. Markov Clustering and phylogenetic analysis were used to determine family identity. More than 15% of the genes belonged to families that have paralogs in the regions, defining two sets of quadruplicate paralogy regions. Thereby, the localization of insulin/relaxin genes in humans is in accordance with those regions on human chromosomes 1, 11, 12, 19q (insulin/insulin-like growth factors) and 1, 6p/15q, 9/5, 19p (insulin-like factors/relaxins) were formed during two genome duplications. We compared the human genome with that of Ciona intestinalis, a species that split from the vertebrate lineage before the two suggested genome duplications. Two insulin-like orthologs were discovered in addition to the already described Ci-insulin gene. Conserved synteny between the Ciona regions hosting the insulin-like genes and the two sets of human paralogons implies their common origin. Linkage of the two human paralogons, as seen in human chromosome 1, as well as the two regions hosting the Ciona insulin-like genes suggests that a segmental duplication gave rise to the region prior to the genome doublings. Thus, preserved gene content provides support that genome duplication(s) in addition to segmental and single-gene duplications shaped the genomes of extant vertebrates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.