Genomes contain a large number of genes that do not have recognizable homologues in other species and that are likely to be involved in important species-specific adaptive processes. The origin of many such "orphan" genes remains unknown. Here we present the first systematic study of the characteristics and mechanisms of formation of primate-specific orphan genes. We determine that codon usage values for most orphan genes fall within the bulk of the codon usage distribution of bona fide human proteins, supporting their current protein-coding annotation. We also show that primate orphan genes display distinctive features in relation to genes of wider phylogenetic distribution: higher tissue specificity, more rapid evolution, and shorter peptide size. We estimate that around 24% are highly divergent members of mammalian protein families. Interestingly, around 53% of the orphan genes contain sequences derived from transposable elements (TEs) and are mostly located in primate-specific genomic regions. This indicates frequent recruitment of TEs as part of novel genes. Finally, we also obtain evidence that a small fraction of primate orphan genes, around 5.5%, might have originated de novo from mammalian noncoding genomic regions.
Charcot-Marie-Tooth disease type 1A (CMT1A) is associated with a DNA duplication at chromosome 17p11.2. In view of the point mutation in the gene for peripheral myelin protein pmp-22/gas-3 in Trembler mice, a murine model for CMT1A, we have analysed whether this gene is altered in CMT1A. Here we show that the human homologue of the murine pmp-22 gene is located within the CMT1A DNA duplication, which is a direct repeat and does not interrupt the coding region of PMP-22. Expression of PMP-22 in CMT1A fibroblasts is similar to expression in control fibroblasts. Increased gene dosage or altered PMP-22 expression in the peripheral nervous system are therefore possible mechanisms by which PMP-22 is involved in CMT1A.
A genetic linkage map of the tetraploid water yam ( Dioscorea alata L.) genome was constructed based on 469 co-dominantly scored amplified fragment length polymorphism (AFLP) markers segregating in an intraspecific F(1) cross. The F(1) was obtained by crossing two improved breeding lines, TDa 95/00328 as female parent and TDa 87/01091 as male parent. Since the mapping population was an F(1) cross between presumed heterozygous parents, marker segregation data from both parents were initially split into maternal and paternal data sets, and separate genetic linkage maps were constructed. Later, data analysis showed that this was not necessary and thus the combined markers from both parents were used to construct a genetic linkage map. The 469 markers were mapped on 20 linkage groups with a total map length of 1,233 cM and a mean marker spacing of 2.62 cM. The markers segregated like a diploid cross-pollinator population suggesting that the water yam genome is allo-tetraploid (2n = 4 x = 40). QTL mapping revealed one AFLP marker E-14/M52-307 located on linkage group 2 that was associated with anthracnose resistance, explaining 10% of the total phenotypic variance. This map covers 65% of the yam genome and is the first linkage map reported for D. alata. The map provides a tool for further genetic analysis of traits of agronomic importance and for using marker-assisted selection in D. alata breeding programmes. QTL mapping opens new avenues for accumulating anthracnose resistance genes in preferred D. alata cultivars.
A genetic linkage map of the tetraploid white yam ( Dioscorea rotundata Poir.) was constructed based on 341 co-dominantly scored amplified fragment length polymorphism (AFLP) markers segregating in an intraspecific F(1) cross. The F(1) mapping population was produced by crossing a landrace cultivar TDr 93-1 as female parent to a breeding line TDr 87/00211 as the male parent. The marker segregation data were split into maternal and paternal data sets, and separate genetic linkage maps were constructed since the mapping population was an F(1) cross between two presumed heterozygous parents. The markers segregated like a diploid cross-pollinator population suggesting that the D. rotundata genome is an allo-tetraploid (2n = 4 x = 40). The maternal map comprised 155 markers mapped on 12 linkage groups with a total map length of 891 cM. Three linkage groups consisted of maternal parent markers only. The paternal map consisted of 157 markers mapped on 13 linkage groups with a total map length of 852 cM. Three and one quantitative trait loci (QTLs) with effects on resistance to Yam Mosaic Virus (YMV) were identified on the maternal and paternal linkage maps, respectively. Prospects for detecting more QTLs and using marker-assisted selection in white yam breeding appear good, but this is subject to the identification of additional molecular markers to cover more of the genome.
For a long time, the complement system in mammals has been regarded as a biological system that plays an essential role in innate immunity. More recently, it has been recognized that the complement system contributes heavily to the generation and development of an acquired immune response. In fact, this ancient mechanism of defense has evolved from a primitive mechanism of innate immune recognition in invertebrate species to that of an effector system that bridges the innate with the adaptive immune response in vertebrate species. When and how did complement evolve into a shared effector system between innate and adaptive immunity? To answer this question, our group is interested in understanding the role of complement in innate and adaptive immune responses in an evolutionary relevant species: the teleost fish. The attractiveness of this species as an animal model is based on two important facts. First, teleost fish are one of the oldest animal species to have developed an adaptive immune response. Second, the complement system of teleost fish offers a unique feature, which is the structural and functional diversity of its main effector protein, C3, the third component of the complement system.
BackgroundThe human chromosome 8p23.1 region contains a 3.8–4.5 Mb segment which can be found in different orientations (defined as genomic inversion) among individuals. The identification of single nucleotide polymorphisms (SNPs) tightly linked to the genomic orientation of a given region should be useful to indirectly evaluate the genotypes of large genomic orientations in the individuals.ResultsWe have identified 16 SNPs, which are in linkage disequilibrium (LD) with the 8p23.1 inversion as detected by fluorescent in situ hybridization (FISH). The variability of the 8p23.1 orientation in 150 HapMap samples was predicted using this set of SNPs and was verified by FISH in a subset of samples. Four genes (NEIL2, MSRA, CTSB and BLK) were found differentially expressed (p<0.0005) according to the orientation of the 8p23.1 region. Finally, we have found variable levels of mosaicism for the orientation of the 8p23.1 as determined by FISH.ConclusionBy means of dense SNP genotyping of the region, haplotype-based computational analyses and FISH experiments we could infer and verify the orientation status of alleles in the 8p23.1 region by detecting two short haplotype stretches at both ends of the inverted region, which are likely the relic of the chromosome in which the original inversion occurred. Moreover, an impact of 8p23.1 inversion on gene expression levels cannot be ruled out, since four genes from this region have statistically significant different expression levels depending on the inversion status. FISH results in lymphoblastoid cell lines suggest the presence of mosaicism regarding the 8p23.1 inversion.
Genomic plasticity of human chromosome 8p23.1 region is highly influenced by two groups of complex segmental duplications (SDs), termed REPD and REPP, that mediate different kinds of rearrangements. Part of the difficulty to explain the wide range of phenotypes associated with 8p23.1 rearrangements is that REPP and REPD are not yet well characterized, probably due to their polymorphic status. Here, we describe a novel primate-specific gene family, named FAM90A (family with sequence similarity 90), found within these SDs. According to the current human reference sequence assembly, the FAM90A family includes 24 members along 8p23.1 region plus a single member on chromosome 12p13.31, showing copy number variation (CNV) between individuals. These genes can be classified into subfamilies I and II, which differ in their upstream and 5'-untranslated region sequences, but both share the same open reading frame and are ubiquitously expressed. Sequence analysis and comparative fluorescence in situ hybridization studies showed that FAM90A subfamily II suffered a big expansion in the hominoid lineage, whereas subfamily I members were likely generated sometime around the divergence of orangutan and African great apes by a fusion process. In addition, the analysis of the Ka/Ks ratios provides evidence of functional constraint of some FAM90A genes in all species. The characterization of the FAM90A gene family contributes to a better understanding of the structural polymorphism of the human 8p23.1 region and constitutes a good example of how SDs, CNVs and rearrangements within themselves can promote the formation of new gene sequences with potential functional consequences.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.