Service Email Alerting click here. top right corner of the article or Receive free email alerts when new articles cite this article-sign up in the box at the http://genome.cshlp.org/subscriptions
Common bean ( L.) is the most important grain legume for human consumption and is a major nutrition source in the tropics. Because bean production is reduced by both abiotic and biotic constraints, current breeding efforts are focused on the development of improved varieties with tolerance to these stresses. We characterized materials from different breeding programs spanning three continents to understand their sequence diversity and advance the development of molecular breeding tools. For this, 37 varieties belonging to , (A. Gray), and L. were sequenced by whole-genome sequencing, identifying more than 40 million genomic variants. Evaluation of nuclear DNA content and analysis of copy number variation revealed important differences in genomic content not only between and the two other domesticated species, but also within , affecting hundreds of protein-coding genomic regions. A large number of inter-gene pool introgressions were identified. Furthermore, interspecific introgressions for disease resistance in breeding lines were mapped. Evaluation of newly developed single nucleotide polymorphism markers within previously discovered quantitative trait loci for common bacterial blight and angular leaf spot provides improved specificity to tag sources of resistance to these diseases. We expect that this dataset will provide a deeper molecular understanding of breeding germplasm and deliver molecular tools for germplasm development, aiming to increase the efficiency of bean breeding programs.
Background Common bean is an important staple crop in the tropics of Africa, Asia and the Americas. Particularly smallholder farmers rely on bean as a source for calories, protein and micronutrients. Drought is a major production constraint for common bean, a situation that will be aggravated with current climate change scenarios. In this context, new tools designed to understand the genetic basis governing the phenotypic responses to abiotic stress are required to improve transfer of desirable traits into cultivated beans. Results A multiparent advanced generation intercross (MAGIC) population of common bean was generated from eight Mesoamerican breeding lines representing the phenotypic and genotypic diversity of the CIAT Mesoamerican breeding program. This population was assessed under drought conditions in two field trials for yield, 100 seed weight, iron and zinc accumulation, phenology and pod harvest index. Transgressive segregation was observed for most of these traits. Yield was positively correlated with yield components and pod harvest index (PHI), and negative correlations were found with phenology traits and micromineral contents. Founder haplotypes in the population were identified using Genotyping by Sequencing (GBS). No major population structure was observed in the population. Whole Genome Sequencing (WGS) data from the founder lines was used to impute genotyping data for GWAS. Genetic mapping was carried out with two methods, using association mapping with GWAS, and linkage mapping with haplotype-based interval screening. Thirteen high confidence QTL were identified using both methods and several QTL hotspots were found controlling multiple traits. A major QTL hotspot located on chromosome Pv01 for phenology traits and yield was identified. Further hotspots affecting several traits were observed on chromosomes Pv03 and Pv08. A major QTL for seed Fe content was contributed by MIB778, the founder line with highest micromineral accumulation. Based on imputed WGS data, candidate genes are reported for the identified major QTL, and sequence changes were identified that could cause the phenotypic variation. Conclusions This work demonstrates the importance of this common bean MAGIC population for genetic mapping of agronomic traits, to identify trait associations for molecular breeding tool design and as a new genetic resource for the bean research community.
BackgroundThe apomictic reproductive mode of Brachiaria (syn. Urochloa) forage species allows breeders to faithfully propagate heterozygous genotypes through seed over multiple generations. In Brachiaria, reproductive mode segregates as single dominant locus, the apospory-specific genomic region (ASGR). The AGSR has been mapped to an area of reduced recombination on Brachiaria decumbens chromosome 5. A primer pair designed within ASGR-BABY BOOM-like (BBML), the candidate gene for the parthenogenesis component of apomixis in Pennisetum squamulatum, was diagnostic for reproductive mode in the closely related species B. ruziziensis, B. brizantha, and B. decumbens. In this study, we used a mapping population of the distantly related commercial species B. humidicola to map the ASGR and test for conservation of ASGR-BBML sequences across Brachiaria species.ResultsDense genetic maps were constructed for the maternal and paternal genomes of a hexaploid (2n = 6x = 36) B. humidicola F1 mapping population (n = 102) using genotyping-by-sequencing, simple sequence repeat, amplified fragment length polymorphism, and transcriptome derived single nucleotide polymorphism markers. Comparative genomics with Setaria italica provided confirmation for x = 6 as the base chromosome number of B. humidicola. High resolution molecular karyotyping indicated that the six homologous chromosomes of the sexual female parent paired at random, whereas preferential pairing of subgenomes was observed in the apomictic male parent. Furthermore, evidence for compensated aneuploidy was found in the apomictic parent, with only five homologous linkage groups identified for chromosome 5 and seven homologous linkage groups of chromosome 6. The ASGR mapped to B. humidicola chromosome 1, a region syntenic with chromosomes 1 and 7 of S. italica. The ASGR-BBML specific PCR product cosegregated with the ASGR in the F1 mapping population, despite its location on a different carrier chromosome than B. decumbens.ConclusionsThe first dense molecular maps of B. humidicola provide strong support for cytogenetic evidence indicating a base chromosome number of six in this species. Furthermore, these results show conservation of the ASGR across the Paniceae in different chromosomal backgrounds and support postulation of the ASGR-BBML as candidate genes for the parthenogenesis component of apomixis.Electronic supplementary materialThe online version of this article (10.1186/s12864-018-5392-4) contains supplementary material, which is available to authorized users.
BackgroundTherecent development and availability of different genotype by sequencing (GBS) protocols provided a cost-effective approach to perform high-resolution genomic analysis of entire populations in different species. The central component of all these protocols is the digestion of the initial DNA with known restriction enzymes, to generate sequencing fragments at predictable and reproducible sites. This allows to genotype thousands of genetic markers on populations with hundreds of individuals. Because GBS protocols achieve parallel genotyping through high throughput sequencing (HTS), every GBS protocol must include a bioinformatics pipeline for analysis of HTS data. Our bioinformatics group recently developed the Next Generation Sequencing Eclipse Plugin (NGSEP) for accurate, efficient, and user-friendly analysis of HTS data.ResultsHere we present the latest functionalities implemented in NGSEP in the context of the analysis of GBS data. We implemented a one step wizard to perform parallel read alignment, variants identification and genotyping from HTS reads sequenced from entire populations. We added different filters for variants, samples and genotype calls as well as calculation of summary statistics overall and per sample, and diversity statistics per site. NGSEP includes a module to translate genotype calls to some of the most widely used input formats for integration with several tools to perform downstream analyses such as population structure analysis, construction of genetic maps, genetic mapping of complex traits and phenotype prediction for genomic selection. We assessed the accuracy of NGSEP on two highly heterozygous F1 cassava populations and on an inbred common bean population, and we showed that NGSEP provides similar or better accuracy compared to other widely used software packages for variants detection such as GATK, Samtools and Tassel.ConclusionsNGSEP is a powerful, accurate and efficient bioinformatics software tool for analysis of HTS data, and also one of the best bioinformatic packages to facilitate the analysis and to maximize the genomic variability information that can be obtained from GBS experiments for population genomics.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-2827-7) contains supplementary material, which is available to authorized users.
Investigations using Electronic Health Records (EHR) databases could enable accurate delineation of psychiatric disease trajectories at an unprecedented scale. Using EHR from a single institution (Clinica San Juan de Dios in Manizales, Colombia), we characterize diagnostic trajectories of >22,000 (ages 4-90, 60% female) individuals treated for severe mental illness (SMI), including schizophrenia (SCZ), bipolar disorder (BD), and severe or recurrent major depressive disorder (MDD). We extracted diagnostic codes, clinical notes, and healthcare use data collected since 2005. Using a subsample of 105 SMI patients, we assessed diagnostic reliability, comparing EHR to clinical chart review. EHR diagnostic codes showed very good agreement with chart review diagnoses (Cohens kappa 0.78). Using 3,600 annotated sentences from 2,788 patients, we developed a pipeline for extracting clinical features from the electronic text, which showed high agreement with gold-standard annotations (average F1 0.88). Factors associated with diagnostic instability, defined as changes in diagnosis between successive visits, were identified using mixed-effect logistic regression models. Of SMI patients with >3 visits (n=12,962), 64% had multiple EHR diagnoses; diagnostic switches (19%), comorbidities (30%), and both (15%). While some diagnostic switches are common, such as the switch from MDD to BD (observed in 22% of BD patients), trajectories are highly heterogeneous, with rare trajectories (occurring in <1% of patients) making up the majority (58% of all patients). Predictors of diagnostic instability include time since initial visit (OR 0.56 by visit number, p-value 2e-66), previous diagnostic change (OR= 4.02, p-value 3e-250) and NLP-derived descriptions of delusions (OR 1.50, p-values 2e-18). Our results underline the importance of considering longitudinal rather than cross-sectional diagnoses in psychiatric research and show how high-quality EHR data can contribute to global efforts to understand disease trajectories.
Bipolar disorder (BD) is a heritable disorder characterized by shifts in mood that manifest in manic or depressive episodes. Clinical studies have identified abnormalities of the circadian system in BD patients as a hallmark of underlying pathophysiology. Fibroblast cells collected from BD patients show temporal differences in the expression of core circadian genes. We set out to examine the underlying genetic architecture of circadian rhythm in an in vitro fibroblast model, in order to disentangle the polygenic nature of BD disease risk. We collected, from primary cell lines of 6 healthy individuals, temporal genomic features over a 48 hour period from transcriptomic data (RNA-seq) and open chromatin data (ATAC-seq). The RNA-seq data showed that only a limited number of genes, such asARNTL, CRY1, PER3, NR1D2andTEFdisplay circadian patterns of expression consistently across cell cultures. The ATAC-seq data identified that distinct transcription factor families, like those with the basic helix-loop-helix motif, were associated with regions that were increasing in accessibility over time. Further evaluation of these regions using stratified linkage disequilibrium score regression (sLDSC) analysis failed to identify a significant presence of them in the known genetic architecture of BD, and other psychiatric disorders or neurobehavioral traits in which the circadian rhythm is affected. This study characterizes the biological pathways that are activated in this in vitro circadian model, evaluating the relevance of these processes in the context of the genetic architecture of BD and other disorders, highlighting its limitations and future applications for circadian genomic studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.