Differences in gene expression may play a major role in speciation and phenotypic diversity. We examined genome-wide differences in transcription factor (TF) binding in several humans and a single chimpanzee using chromatin immunoprecipitation followed by sequencing (ChIP-Seq). The binding sites of RNA Polymerase II (PolII) and a key regulator of immune responses, NFκB (p65), were mapped in ten lymphoblastoid cell lines and 25% and 7.5% of the respective binding regions were found to differ between individuals. Binding differences were frequently associated with SNPs and genomic structural variants (SVs) and were often correlated with differences in gene expression, suggesting functional consequences of binding variation. Furthermore, comparing PolII binding between human and chimpanzee suggests extensive divergence in TF binding. Our results indicate that many differences in individuals and species occur at the level of TF binding and provide insight into the genetic events responsible for these differences. TextDifferences in gene expression have been observed in a variety of species (1-3). However, the extent to which TF binding differences occur both within individuals and closely related species and the global relationship between TF binding and genetic variation are largely unexplored (4). We used ChIP-Seq to map NFκB and PolII binding sites in ten humans: five are of European ancestry (including a parent-offspring trio), two of eastern Asian ancestry, and three of Nigerian
Differences in gene expression may play a major role in speciation and phenotypic diversity. We examined genome-wide differences in transcription factor (TF) binding in several humans and a single chimpanzee by using chromatin immunoprecipitation followed by sequencing. The binding sites of RNA polymerase II (PolII) and a key regulator of immune responses, nuclear factor kB (p65), were mapped in 10 lymphoblastoid cell lines, and 25 and 7.5% of the respective binding regions were found to differ between individuals. Binding differences were frequently associated with single-nucleotide polymorphisms and genomic structural variants, and these differences were often correlated with differences in gene expression, suggesting functional consequences of binding variation. Furthermore, comparing PolII binding between humans and chimpanzee suggests extensive divergence in TF binding. Our results indicate that many differences in individuals and species occur at the level of TF binding, and they provide insight into the genetic events responsible for these differences.
Apomixis, asexual reproduction through seed, enables breeders to identify and faithfully propagate superior heterozygous genotypes by seed without the disadvantages of vegetative propagation or the expense and complexity of hybrid seed production. The availability of new tools such as genotyping by sequencing and bioinformatics pipelines for species lacking reference genomes now makes the construction of dense maps possible in apomictic species, despite complications including polyploidy, multisomic inheritance, self-incompatibility, and high levels of heterozygosity. In this study, we developed saturated linkage maps for the maternal and paternal genomes of an interspecific Brachiaria ruziziensis (R. Germ. and C. M. Evrard) × B. decumbens Stapf. F1 mapping population in order to identify markers linked to apomixis. High-resolution molecular karyotyping and comparative genomics with Setaria italica (L.) P. Beauv provided conclusive evidence for segmental allopolyploidy in B. decumbens, with strong preferential pairing of homologs across the genome and multisomic segregation relatively more common in chromosome 8. The apospory-specific genomic region (ASGR) was mapped to a region of reduced recombination on B. decumbens chromosome 5. The Pennisetum squamulatum (L.) R.Br. PsASGR-BABY BOOM-like (psASGR–BBML)-specific primer pair p779/p780 was in perfect linkage with the ASGR in the F1 mapping population and diagnostic for reproductive mode in a diversity panel of known sexual and apomict Brachiaria (Trin.) Griseb. and P. maximum Jacq. germplasm accessions and cultivars. These findings indicate that ASGR–BBML gene sequences are highly conserved across the Paniceae and add further support for the postulation of the ASGR–BBML as candidate genes for the apomictic function of parthenogenesis.
Summary The rice pathogens Xanthomonas oryzae pathovar (pv.) oryzae and pv. oryzicola produce numerous transcription activator-like (TAL) effectors that increase bacterial virulence by activating expression of host susceptibility genes. Rice resistance mechanisms against TAL effectors include polymorphisms that prevent effector binding to susceptibility gene promoters, or that allow effector activation of resistance genes. This study identifies, in the heirloom variety Carolina Gold Select, a third mechanism of rice resistance involving TAL effectors. This resistance manifests through strong suppression of disease development in response to diverse TAL effectors from both X. oryzae pathovars. The resistance can be triggered by an effector with only 3.5 central repeats, is independent of the composition of the repeat variable diresidues that determine TAL effector binding specificity, and is independent of the transcriptional activation domain. We determined that the resistance is conferred by a single dominant locus, designated Xo1, that maps to a 1.09 Mbp fragment on chromosome 4. The Xo1 interval also confers complete resistance to the strains in the African clade of X. oryzae pv. oryzicola, representing the first dominant resistance locus against bacterial leaf streak in rice. The strong phenotypic similarity between the TAL effector triggered resistance conferred by Xo1 and that conferred by the tomato resistance gene Bs4 suggests that monocots and dicots share an ancient or convergently evolved mechanism to recognize analogous TAL effector epitopes.
Low-coverage next-generation sequencing methodologies are routinely employed to genotype large populations. Missing data in these populations manifest both as missing markers and markers with incomplete allele recovery. False homozygous calls at heterozygous sites resulting from incomplete allele recovery confound many existing imputation algorithms. These types of systematic errors can be minimized by incorporating depth-of-sequencing read coverage into the imputation algorithm. Accordingly, we developed Low-Coverage Biallelic Impute (LB-Impute) to resolve missing data issues. LB-Impute uses a hidden Markov model that incorporates marker read coverage to determine variable emission probabilities. Robust, highly accurate imputation results were reliably obtained with LB-Impute, even at extremely low (,13) average per-marker coverage. This finding will have implications for the design of genotype imputation algorithms in the future. LB-Impute is publicly available on GitHub at https://github.com/dellaportalaboratory/LB-Impute.KEYWORDS hidden Markov models; imputation; next-generation sequencing; population genetics; plant genomics T HE imputation of missing genotype data has been a key research topic in statistical genetics since well before the advent of next-generation sequencing (NGS) technologies. The goal of many of these algorithms was to reconstruct haplotypes from Sanger or microarray-based genotyping, usually on human populations. Strategies employing the expectation-maximization algorithm (Hawley and Kidd 1995;Long et al. 1995;Qin et al. 2002;Scheet and Stephens 2006), Bayesian inference Stephens and Donnelly 2003), or Markovian methodology (Stephens et al. 2001;Broman et al. 2003;Broman and Sen 2009), local ancestry and gametic phase, could be used to resolve missing markers within a population (Browning and Browning 2011). In these cases, missing genotypes were assigned based on the most likely proximal haplotypes. These computational methods greatly increased the informative content of genotyping information, especially for population studies (Spencer et al. 2009;Cleveland et al. 2011). While these programs were powerful and accurate, they also could be computationally expensive. Further, they assumed that available genotypes were largely correct, which could cause issues with sequencing data sets.The development of programs that focused primarily on the imputation of missing data and haplotype phasing was likely motivated by several factors. Genome-wide association studies could be enhanced by the inference of additional markers using large multipopulation data sets such as the International HapMap Project (International HapMap Consortium et al. 2010). The emergence of the meta-analysis led to a need for algorithms that could merge disparate data sets Howie et al. 2009;Li et al. 2010;Liu et al. 2013;Fuchsberger et al. 2015). These algorithms often employed large haplotype reference panels to improve imputation (Marchini et al. 2007;Browning and Browning 2009;Howie et al. 2009). In bialleli...
The widely recounted story of the origin of cultivated strawberry (Fragaria × ananassa) oversimplifies the complex interspecific hybrid ancestry of the highly admixed populations from which heirloom and modern cultivars have emerged. To develop deeper insights into the three century long domestication history of strawberry, we reconstructed the genealogy as deeply as possible—pedigree records were assembled for 8,851 individuals, including 2,656 cultivars developed since 1775. The parents of individuals with unverified or missing pedigree records were accurately identified by applying exclusion analysis to array-genotyped single nucleotide polymorphisms. We identified 187 wild octoploid and 1,171 F. × ananassa founders in the genealogy, from the earliest hybrids to modern cultivars. The pedigree networks for cultivated strawberry are exceedingly complex labyrinths of ancestral interconnections formed by diverse hybrid ancestry, directional selection, migration, admixture, bottlenecks, overlapping generations, and recurrent hybridization with common ancestors that have unequally contributed allelic diversity to heirloom and modern cultivars. Fifteen to 333 ancestors were predicted to have transmitted 90% of the alleles found in country-, region-, and continent-specific populations. Using parent-offspring edges in the global pedigree network, we found that selection cycle lengths over the last 200 years of breeding have been extraordinarily long (16.0-16.9 years/generation) but decreased to a present-day range of 6.0-10.0 years/generation. Our analyses uncovered conspicuous differences in the ancestry and structure of North American and European populations and shed light on forces that have shaped phenotypic diversity in F. × ananassa.
BackgroundMany areas critical to agricultural production and research, such as the breeding and trait mapping in plants and livestock, require robust and scalable genotyping platforms. Genotyping-by-sequencing (GBS) is a one such method highly suited to non-human organisms. In the GBS protocol, genomic DNA is fractionated via restriction digest, then reduced representation is achieved through size selection. Since many restriction sites are conserved across a species, the sequenced portion of the genome is highly consistent within a population. This makes the GBS protocol highly suited for experiments that require surveying large numbers of markers within a population, such as those involving genetic mapping, breeding, and population genomics. We have modified the GBS technology in a number of ways. Custom, enzyme specific adaptors have been replaced with standard Illumina adaptors compatible with blunt-end restriction enzymes. Multiplexing is achieved through a dual barcoding system, and bead-based library preparation protocols allows for in-solution size selection and eliminates the need for columns and gels.ResultsA panel of eight restriction enzymes was selected for testing on B73 maize and Nipponbare rice genomic DNA. Quality of the data was demonstrated by identifying that the vast majority of reads from each enzyme aligned to restriction sites predicted in silico. The link between enzyme parameters and experimental outcome was demonstrated by showing that the sequenced portion of the genome was adaptable by selecting enzymes based on motif length, complexity, and methylation sensitivity. The utility of the new GBS protocol was demonstrated by correctly mapping several in a maize F2 population resulting from a B73 × Country Gentleman test cross.ConclusionsThis technology is readily adaptable to different genomes, highly amenable to multiplexing and compatible with over forty commercially available restriction enzymes. These advancements represent a major improvement in genotyping technology by providing a highly flexible and scalable GBS that is readily implemented for studies on genome-wide variation.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-979) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.