Although common among bacteria, lateral gene transfer-the movement of genes between distantly related organisms-is thought to occur only rarely between bacteria and multicellular eukaryotes. However, the presence of endosymbionts, such as Wolbachia pipientis, within some eukaryotic germlines may facilitate bacterial gene transfers to eukaryotic host genomes. We therefore examined host genomes for evidence of gene transfer events from Wolbachia bacteria to their hosts. We found and confirmed transfers into the genomes of 4 insect and 4 nematode species that range from nearly the entire Wolbachia genome (>1 megabase) to short (<500 base pairs) insertions. Potential Wolbachia to host transfers were also detected computationally in three additional sequenced insect genomes. We also show that some of these inserted Wolbachia genes are transcribed within eukaryotic cells lacking endosymbionts. Therefore, heritable lateral gene transfer occurs into eukaryotic hosts from their prokaryote symbionts, potentially providing a mechanism for acquisition of new genes and functions.
The complete 108,845-nucleotide sequence of catabolic plasmid pADP-1 from Pseudomonas sp. strain ADP was determined. Plasmid pADP-1 was previously shown to encode AtzA, AtzB, and AtzC, which catalyze the sequential hydrolytic removal of s-triazine ring substituents from the herbicide atrazine to yield cyanuric acid. Computational analyses indicated that pADP-1 encodes 104 putative open reading frames (ORFs), which are predicted to function in catabolism, transposition, and plasmid maintenance, transfer, and replication. Regions encoding transfer and replication functions of pADP-1 had 80 to 100% amino acid sequence identity to pR751, an IncP plasmid previously isolated from Enterobacter aerogenes. pADP-1 was shown to contain a functional mercury resistance operon with 99% identity to Tn5053. Complete copies of transposases with 99% amino acid sequence identity to TnpA from IS1071 and TnpA from Pseudomonas pseudoalcaligenes were identified and flank each of the atzA, atzB, and atzC genes, forming structures resembling nested catabolic transposons. Functional analyses identified three new catabolic genes, atzD, atzE, and atzF, which participate in atrazine catabolism. Crude extracts from Escherichia coli expressing AtzD hydrolyzed cyanuric acid to biuret. AtzD showed 58% amino acid sequence identity to TrzD, a cyanuric acid amidohydrolase, from Pseudomonas sp. strain NRRLB-12227. Two other genes encoding the further catabolism of cyanuric acid, atzE and atzF, reside in a contiguous cluster adjacent to a potential LysR-type transcriptional regulator. E. coli strains bearing atzE and atzF were shown to encode a biuret hydrolase and allophanate hydrolase, respectively. atzDEF are cotranscribed. AtzE and AtzF are members of a common amidase protein family. These data reveal the complete structure of a catabolic plasmid and show that the atrazine catabolic genes are dispersed on three disparate regions of the plasmid. These results begin to provide insight into how plasmids are structured, and thus evolve, to encode the catabolism of compounds recently added to the biosphere.
Soil bacteria that also form mutualistic symbioses in plants encounter two major levels of selection. One occurs during adaptation to and survival in soil, and the other occurs in concert with host plant speciation and adaptation. Actinobacteria from the genus Frankia are facultative symbionts that form N 2 -fixing root nodules on diverse and globally distributed angiosperms in the "actinorhizal" symbioses. Three closely related clades of Frankia sp. strains are recognized; members of each clade infect a subset of plants from among eight angiosperm families. We sequenced the genomes from three strains; their sizes varied from 5.43 Mbp for a narrow host range strain (Frankia sp. strain HFPCcI3) to 7.50 Mbp for a medium host range strain (Frankia alni strain ACN14a) to 9.04 Mbp for a broad host range strain (Frankia sp. strain EAN1pec.) This size divergence is the largest yet reported for such closely related soil bacteria (97.8%-98
Lack of complete chloroplast genome sequences is still one of the major limitations to extending chloroplast genetic engineering technology to useful crops. Therefore, we sequenced the soybean chloroplast genome and compared it to the other completely sequenced legumes, Lotus and Medicago. The chloroplast genome of Glycine is 152,218 basepairs (bp) in length, including a pair of inverted repeats of 25,574 bp of identical sequence separated by a small single copy region of 17,895 bp and a large single copy region of 83,175 bp. The genome contains 111 unique genes, and 19 of these are duplicated in the inverted repeat (IR). Comparisons of Glycine, Lotus and Medicago confirm the organization of legume chloroplast genomes based on previous studies. Gene content of the three legumes is nearly identical. The rpl22 gene is missing from all three legumes, and Medicago is missing rps16 and one copy of the IR. Gene order in Glycine, Lotus, and Medicago differs from the usual gene order for angiosperm chloroplast genomes by the presence of a single, large inversion of 51 kilobases (kb). Detailed analyses of repeated sequences indicate that many of the Glycine repeats that are located in the intergenic spacer regions and introns occur in the same location in the other legumes and in Arabidopsis, suggesting that they may play some functional role. The presence of small repeats of psbA and rbcL in legumes that have lost one copy of the IR indicate that this loss has only occurred once during the evolutionary history of legumes.
Despite the agricultural importance of both potato and tomato, very little is known about their chloroplast genomes. Analysis of the complete sequences of tomato, potato, tobacco, and Atropa chloroplast genomes reveals significant insertions and deletions within certain coding regions or regulatory sequences (e.g., deletion of repeated sequences within 16S rRNA, ycf2 or ribosomal binding sites in ycf2). RNA, photosynthesis, and atp synthase genes are the least divergent and the most divergent genes are clpP, cemA, ccsA, and matK. Repeat analyses identified 33-45 direct and inverted repeats >or=30 bp with a sequence identity of at least 90%; all but five of the repeats shared by all four Solanaceae genomes are located in the same genes or intergenic regions, suggesting a functional role. A comprehensive genome-wide analysis of all coding sequences and intergenic spacer regions was done for the first time in chloroplast genomes. Only four spacer regions are fully conserved (100% sequence identity) among all genomes; deletions or insertions within some intergenic spacer regions result in less than 25% sequence identity, underscoring the importance of choosing appropriate intergenic spacers for plastid transformation and providing valuable new information for phylogenetic utility of the chloroplast intergenic spacer regions. Comparison of coding sequences with expressed sequence tags showed considerable amount of variation, resulting in amino acid changes; none of the C-to-U conversions observed in potato and tomato were conserved in tobacco and Atropa. It is possible that there has been a loss of conserved editing sites in potato and tomato.
Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/ IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19-37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16-21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C-U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae.
Fine mapping and positional cloning will eventually improve with the anchoring of additional markers derived from genomic clones such as BACs. From 2,603 new BAC-end genomic sequences from Gossypium hirsutum Acala 'Maxxa', 1,316 PCR primer pairs (designated as MUSB) were designed to flank microsatellite or simple sequence repeat motif sequences. Most (1164 or 88%) MUSB primer pairs successfully amplified DNA from three species of cotton with an average of three amplicons per marker and 365 markers (21%) were polymorphic between G. hirsutum and G. barbadense. An interspecific RIL population developed from the above two entries was used to map 433 marker loci and 46 linkage groups with a genetic distance of 2,126.3 cM covering approximately 45% of the cotton genome and an average distance between two loci of 4.9 cM. Based on genome-specific chromosomes identified in G. hirsutum tetraploid (A and D), 56.9% of the coverage was located on the A subgenome while 39.7% was assigned to the D subgenome in the genetic map, suggesting that the A subgenome may be more polymorphic and recombinationally active than originally thought. The linkage groups were assigned to 23 of the 26 chromosomes. This is the first genetic map in which the linkage groups A01 and A02/D03 have been assigned to specific chromosomes. In addition the MUSB-derived markers from BAC-end sequences markers allows fine genetic and QTL mapping of important traits and for the first time provides reconciliation of the genetic and physical maps. Limited QTL analyses suggested that loci on chromosomes 2, 3, 12, 15 and 18 may affect variation in fiber quality traits. The original BAC clones containing the newly mapped MUSB that tag the QTLs provide critical DNA regions for the discovery of gene sequences involved in biological processes such as fiber development and pest resistance in cotton.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.