Echinobase (www.echinobase.org) is a third generation web resource supporting genomic research on echinoderms. The new version was built by cloning the mature Xenopus model organism knowledgebase, Xenbase, refactoring data ingestion pipelines and modifying the user interface to adapt to multispecies echinoderm content. This approach leveraged over 15 years of previous database and web application development to generate a new fully featured informatics resource in a single year. In addition to the software stack, Echinobase uses the private cloud and physical hosts that support Xenbase. Echinobase currently supports six echinoderm species, focused on those used for genomics, developmental biology and gene regulatory network analyses. Over 38 000 gene pages, 18 000 publications, new improved genome assemblies, JBrowse genome browser and BLAST + services are available and supported by the development of a new echinoderm anatomical ontology, uniformly applied formal gene nomenclature, and consistent orthology predictions. A novel feature of Echinobase is integrating support for multiple, disparate species. New genomes from the diverse echinoderm phylum will be added and supported as data becomes available. The common code development design of the integrated knowledgebases ensures parallel improvements as each resource evolves. This approach is widely applicable for developing new model organism informatics resources.
The Alliance of Genome Resources (the Alliance) is a combined effort of 7 knowledgebase projects: Saccharomyces Genome Database, WormBase, FlyBase, Mouse Genome Database, the Zebrafish Information Network, Rat Genome Database, and the Gene Ontology Resource. The Alliance seeks to provide several benefits: better service to the various communities served by these projects; a harmonized view of data for all biomedical researchers, bioinformaticians, clinicians, and students; and a more sustainable infrastructure. The Alliance has harmonized cross-organism data to provide useful comparative views of gene function, gene expression, and human disease relevance. The basis of the comparative views is shared calls of orthology relationships and the use of common ontologies. The key types of data are alleles and variants, gene function based on gene ontology annotations, phenotypes, association to human disease, gene expression, protein–protein and genetic interactions, and participation in pathways. The information is presented on uniform gene pages that allow facile summarization of information about each gene in each of the 7 organisms covered (budding yeast, roundworm Caenorhabditis elegans, fruit fly, house mouse, zebrafish, brown rat, and human). The harmonized knowledge is freely available on the alliancegenome.org portal, as downloadable files, and by APIs. We expect other existing and emerging knowledge bases to join in the effort to provide the union of useful data and features that each knowledge base currently provides.
The study of biogeography seeks taxa that share a key set of characteristics, such as timescale of diversification, dispersal ability, and ecological lability. Tarantulas are ideal organisms for studying evolution over continental-scale biogeography given their time period of diversification, their mostly long-lived sedentary lives, low dispersal rate, and their nevertheless wide circumtropical distribution. In tandem with a time-calibrated transcriptome-based phylogeny generated by PhyloBayes, we estimate the ancestral ranges of ancient tarantulas using two methods, DEC+j and BBM, in the context of their evolution. We recover two ecologically distinct tarantula lineages that evolved on the Indian Plate before it collided with Asia, emphasizing the evolutionary significance of the region, and show that both lineages diversified across Asia at different times. The most ancestral tarantulas emerge on the Americas and Africa 120 Ma–105.5 Ma. We provide support for a dual colonization of Asia by two different tarantula lineages that occur at least 20 million years apart, as well as a Gondwanan origin for the group. We determine that their current distributions are attributable to a combination of Gondwanan vicariance, continental rafting, and geographic radiation. We also discuss emergent patterns in tarantula habitat preferences through time.
Xenbase (https://www.xenbase.org/), the Xenopus model organism knowledgebase, is a web accessible resource that integrates the diverse genomic and biological data from research on the laboratory frogs Xenopus laevis and Xenopus tropicalis. The goal of Xenbase is to accelerate discovery and empower Xenopus research, enhance the impact of Xenopus research data and to facilitate the dissemination of these data. Xenbase also enhances the value of Xenopus data through high quality curation, data integration, providing bioinformatics tools optimized for Xenopus experiments, and by linking Xenopus data to humans and other model organisms. Xenbase also plays an indispensable role in making Xenopus data interoperable and accessible to the broader biomedical community in accordance with FAIR principles. Xenbase provides annotated data updates to organizations such as NCBI, UniProtKB, Ensembl, the Gene Ontology consortium and mostly recently, the Alliance of Genomic Resources, a common clearing house for data from humans and model organisms. This article provides a brief overview of key and recently added features of Xenbase. New features include processing of Xenopus high throughput sequencing data from the NCBI Gene Expression Omnibus, curation of anatomical, physiological and expression phenotypes with the newly created Xenopus Phenotype Ontology, Xenopus Gene Ontology annotations, new anatomical drawings of the Normal Table of Xenopus development, and integration of the latest Xenopus laevis v10.1 genome annotations. Finally, we highlight areas for future development at Xenbase as we continue to support the Xenopus research community.
Echinobase (https://echinobase.org) is a central online platform that generates, manages and hosts genomic data relevant to echinoderm research. While the resource primarily serves the echinoderm research community, the recent release of an excellent quality genome for the frequently studied purple sea urchin (Strongylocentrotus purpuratus genome, v5.0) has provided an opportunity to adapt to the needs of a broader research community across other model systems. To this end, establishing pipelines to identify orthologous genes between echinoderms and other species has become a priority in many contexts including nomenclature, linking to data in other model organisms, and in internal functionality where data gathered in one hosted species can be associated with genes in other hosted echinoderms. This paper describes the orthology pipelines currently employed by Echinobase and how orthology data are processed to yield 1:1 ortholog mappings between a variety of echinoderms and other model taxa. We also describe functions of interest that have recently been included on the resource, including an updated developmental time course for S.purpuratus, and additional tracks for genome browsing. These data enhancements will increase the accessibility of the resource to non-echinoderm researchers and simultaneously expand the data quality and quantity available to core Echinobase users. Database URL: https://echinobase.org
A keyword-based search of comprehensive databases such as PubMed may return irrelevant papers, especially if the keywords are used in multiple fields of study. In such cases, domain experts (curators) need to verify the results and remove the irrelevant articles. Automating this filtering process will save time, but it has to be done well enough to ensure few relevant papers are rejected and few irrelevant papers are accepted. A good solution would be fast, work with the limited amount of data freely available (full paper body may be missing), handle ambiguous keywords and be as domain-neutral as possible. In this paper, we evaluate a number of classification algorithms for identifying a domain-specific set of papers about echinoderm species and show that the resulting tool satisfies most of the abovementioned requirements. Echinoderms consist of a number of very different organisms, including brittle stars, sea stars (starfish), sea urchins and sea cucumbers. While their taxonomic identifiers are specific, the common names are used in many other contexts, creating ambiguity and making a keyword search prone to error. We try classifiers using Linear, Naïve Bayes, Nearest Neighbor, Tree, SVM, Bagging, AdaBoost and Neural Network learning models and compare their performance. We show how effective the resulting classifiers are in filtering irrelevant articles returned from PubMed. The methodology used is more dependent on the good selection of training data and is a practical solution that can be applied to other fields of study facing similar challenges. Database URL : The code and date reported in this paper are freely available at http://xenbaseturbofrog.org/pub/Text-Topic-Classifier/
Tarantulas paradoxically exhibit a diverse palette of vivid coloration despite their crepuscular to nocturnal habits. The evolutionary origin and maintenance of these colours remains mysterious. In this study, we reconstructed the ancestral states of both blue and green coloration in tarantula setae, and tested how these colours correlate with presence of stridulation, urtication and arboreality. Green coloration has probably evolved at least eight times, and blue coloration is probably an ancestral condition that appears to be lost more frequently than gained. While our results indicate that neither colour correlates with the presence of stridulation or urtication, the evolution of green coloration appears to depend upon the presence of arboreality, suggesting that it ptobably originated for and functions in crypsis through substrate matching among leaves. We also constructed a network of opsin homologues across tarantula transcriptomes. Despite their crepuscular tendencies, tarantulas express a considerable diversity of opsin genes—a finding that contradicts current consensus that tarantulas have poor colour vision on the basis of low opsin diversity. Overall, our findings raise the possibility that blue coloration could have ultimately evolved via sexual selection and perhaps proximately be used in mate choice or predation avoidance due to possible sex differences in mate-searching.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.