Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington University in St. Louis, is to provide such research opportunities. Using a versatile curriculum that has been adapted to many different class settings, GEP undergraduates undertake projects to bring draft-quality genomic sequence up to high quality and/or participate in the annotation of these sequences. GEP undergraduates have improved more than 2 million bases of draft genomic sequence from several species of Drosophila and have produced hundreds of gene models using evidence-based manual annotation. Students appreciate their ability to make a contribution to ongoing research, and report increased independence and a more active learning approach after participation in GEP projects. They show knowledge gains on pre- and postcourse quizzes about genes and genomes and in bioinformatic analysis. Participating faculty also report professional gains, increased access to genomics-related technology, and an overall positive experience. We have found that using a genomics research project as the core of a laboratory course is rewarding for both faculty and students.
While course-based research in genomics can generate both knowledge gains and a greater appreciation for how science is done, a significant investment of course time is required to enable students to show gains commensurate to a summer research experience. Nonetheless, this is a very cost-effective way to reach larger numbers of students.
The Genomics Education Partnership offers an inclusive model for undergraduate research experiences incorporated into the academic year science curriculum, with students pooling their work to contribute to international data bases.
There have been numerous calls to engage students in science as science is done. A survey of 90-plus faculty members explores barriers and incentives when developing a research-based genomics course. The results indicate that a central core supporting a national experiment can help overcome local obstacles.
The Muller F element (4.2 Mb, ~80 protein-coding genes) is an unusual autosome of Drosophila melanogaster; it is mostly heterochromatic with a low recombination rate. To investigate how these properties impact the evolution of repeats and genes, we manually improved the sequence and annotated the genes on the D. erecta, D. mojavensis, and D. grimshawi F elements and euchromatic domains from the Muller D element. We find that F elements have greater transposon density (25–50%) than euchromatic reference regions (3–11%). Among the F elements, D. grimshawi has the lowest transposon density (particularly DINE-1: 2% vs. 11–27%). F element genes have larger coding spans, more coding exons, larger introns, and lower codon bias. Comparison of the Effective Number of Codons with the Codon Adaptation Index shows that, in contrast to the other species, codon bias in D. grimshawi F element genes can be attributed primarily to selection instead of mutational biases, suggesting that density and types of transposons affect the degree of local heterochromatin formation. F element genes have lower estimated DNA melting temperatures than D element genes, potentially facilitating transcription through heterochromatin. Most F element genes (~90%) have remained on that element, but the F element has smaller syntenic blocks than genome averages (3.4–3.6 vs. 8.4–8.8 genes per block), indicating greater rates of inversion despite lower rates of recombination. Overall, the F element has maintained characteristics that are distinct from other autosomes in the Drosophila lineage, illuminating the constraints imposed by a heterochromatic milieu.
The Drosophila Sex Comb on Midleg (SCM) protein is a transcriptional repressor of the Polycomb group (PcG). Although genetic studies establish SCM as a crucial PcG member, its molecular role is not known. To investigate how SCM might link to PcG complexes, we analyzed the in vivo role of a conserved protein interaction module, the SPM domain. This domain is found in SCM and in another PcG protein, Polyhomeotic (PH), which is a core component of Polycomb repressive complex 1 (PRC1). SCM-PH interactions in vitro are mediated by their respective SPM domains. Yeast two-hybrid and in vitro binding assays were used to isolate and characterize Ͼ30 missense mutations in the SPM domain of SCM. Genetic rescue assays showed that SCM repressor function in vivo is disrupted by mutations that impair SPM domain interactions in vitro. Furthermore, overexpression of an isolated, wild-type SPM domain produced PcG loss-of-function phenotypes in flies. Coassembly of SCM with a reconstituted PRC1 core complex shows that SCM can partner with PRC1. However, gel filtration chromatography showed that the bulk of SCM is biochemically separable from PH in embryo nuclear extracts. These results suggest that SCM, although not a core component of PRC1, interacts and functions with PRC1 in gene silencing.
The discordance between genome size and the complexity of eukaryotes can partly be attributed to differences in repeat density. The Muller F element (∼5.2 Mb) is the smallest chromosome in Drosophila melanogaster, but it is substantially larger (>18.7 Mb) in D. ananassae. To identify the major contributors to the expansion of the F element and to assess their impact, we improved the genome sequence and annotated the genes in a 1.4-Mb region of the D. ananassae F element, and a 1.7-Mb region from the D element for comparison. We find that transposons (particularly LTR and LINE retrotransposons) are major contributors to this expansion (78.6%), while Wolbachia sequences integrated into the D. ananassae genome are minor contributors (0.02%). Both D. melanogaster and D. ananassae F-element genes exhibit distinct characteristics compared to D-element genes (e.g., larger coding spans, larger introns, more coding exons, and lower codon bias), but these differences are exaggerated in D. ananassae. Compared to D. melanogaster, the codon bias observed in D. ananassae F-element genes can primarily be attributed to mutational biases instead of selection. The 5′ ends of F-element genes in both species are enriched in dimethylation of lysine 4 on histone 3 (H3K4me2), while the coding spans are enriched in H3K9me2. Despite differences in repeat density and gene characteristics, D. ananassae F-element genes show a similar range of expression levels compared to genes in euchromatic domains. This study improves our understanding of how transposons can affect genome size and how genes can function within highly repetitive domains.
A hallmark of the research experience is encountering difficulty and working through those challenges to achieve success. This ability is essential to being a successful scientist, but replicating such challenges in a teaching setting can be difficult. The Genomics Education Partnership (GEP) is a consortium of faculty who engage their students in a genomics Course-Based Undergraduate Research Experience (CURE). Students participate in genome annotation, generating gene models using multiple lines of experimental evidence. Our observations suggested that the students' learning experience is continuous and recursive, frequently beginning with frustration but eventually leading to success as they come up with defendable gene models. In order to explore our "formative frustration" hypothesis, we gathered data from faculty via a survey, and from students via both a general survey and a set of student focus groups. Upon analyzing these data, we found that all three datasets mentioned frustration and struggle, as well as learning and better understanding of the scientific process. Bioinformatics projects are particularly well suited to the process of iteration and refinement because iterations can be performed quickly and are inexpensive in both time and money. Based on these findings, we suggest that a dynamic of "formative frustration" is an important aspect for a successful CURE.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.