Today’s science increasingly requires effective ways to find and access existing datasets that are distributed across a range of repositories. For researchers in the life sciences, discoverability of datasets may soon become as essential as identifying the latest publications via PubMed. Through an international collaborative effort funded by the National Institutes of Health (NIH)’s Big Data to Knowledge (BD2K) initiative, we have designed and implemented the DAta Tag Suite (DATS) model to support the DataMed data discovery index. DataMed’s goal is to be for data what PubMed has been for the scientific literature. Akin to the Journal Article Tag Suite (JATS) used in PubMed, the DATS model enables submission of metadata on datasets to DataMed. DATS has a core set of elements, which are generic and applicable to any type of dataset, and an extended set that can accommodate more specialized data types. DATS is a platform-independent model also available as an annotated serialization in schema.org, which in turn is widely used by major search engines like Google, Microsoft, Yahoo and Yandex.
BackgroundSporopollenin is a major component of the pollen exine pattern. In Arabidopsis, acyl-CoA synthetase5 (ACOS5) is involved in sporopollenin precursor biosynthesis. In this study, we identified its orthologue, OsACOS12, in rice (Oryza sativa) and compared the functional conservation of ACOS in rice to Arabidopsis.ResultsSequence analysis showed that OsACOS12 shares 63.9 % amino acid sequence identity with ACOS5. The osacos12 mutation caused by a pre-mature stop codon in LOC_Os04g24530 exhibits defective sexine resulting in a male sterile phenotype in rice. In situ hybridization shows that OsACOS12 is expressed in tapetal cells and microspores at the transcript level. The localization of OsACOS12-GFP demonstrated that OsACOS12 protein is accumulated in tapetal cells and anther locules. OsACOS12 driven by the ACOS5 promoter could partially restore the male fertility of the acos5 mutant in Arabidopsis.Conclusions
OsACOS12 is an orthologue of ACOS5 that is essential for sporopollenin synthesis in rice. ACOS5 and OsACOS12 are conserved for pollen wall formation in monocot and dicot species.Electronic supplementary materialThe online version of this article (doi:10.1186/s12870-016-0943-9) contains supplementary material, which is available to authorized users.
Our manual review shows that the ingestion pipeline could achieve an accuracy of 90% and core elements of DATS had varied frequency across repositories. On a manually curated benchmark dataset, the DataMed search engine achieved an inferred average precision of 0.2033 and a precision at 10 (P@10, the number of relevant results in the top 10 search results) of 0.6022, by implementing advanced natural language processing and terminology services. Currently, we have made the DataMed system publically available as an open source package for the biomedical community.
To assess changes of metabolite content and regulation mechanism of the phenolic acid biosynthesis pathway at different developmental stages of leaves, this study performed a combined metabolome and transcriptome analysis of Cyclocarya paliurus leaves at different developmental stages. Metabolite and transcript profiling were conducted by ultra-performance liquid chromatography quadrupole time-of-flight tandem mass spectrometer and high-throughput RNA sequencing, respectively. Transcriptome identification showed that 58 genes were involved in the biosynthesis of phenolic acid. Among them, 10 differentially expressed genes were detected between every two developmental stages. Identification and quantification of metabolites indicated that 14 metabolites were located in the phenolic acid biosynthetic pathway. Among them, eight differentially accumulated metabolites were detected between every two developmental stages. Association analysis between metabolome and transcriptome showed that six differentially expressed structural genes were significantly positively correlated with metabolite accumulation and showed similar expression trends. A total of 128 transcription factors were identified that may be involved in the regulation of phenolic acid biosynthesis; these include 12 MYBs and 10 basic helix–loop–helix (bHLH) transcription factors. A regulatory network of the phenolic acid biosynthesis was established to visualize differentially expressed candidate genes that are involved in the accumulation of metabolites with significant differences. The results of this study contribute to the further understanding of phenolic acid biosynthesis during the development of leaves of C. paliurus.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.