2020
DOI: 10.1093/database/baaa064
|View full text |Cite
|
Sign up to set email alerts
|

A content-based dataset recommendation system for researchers—a case study on Gene Expression Omnibus (GEO) repository

Abstract: It is a growing trend among researchers to make their data publicly available for experimental reproducibility and data reusability. Sharing data with fellow researchers helps in increasing the visibility of the work. On the other hand, there are researchers who are inhibited by the lack of data resources. To overcome this challenge, many repositories and knowledge bases have been established to date to ease data sharing. Further, in the past two decades, there has been an exponential increase in the number of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(37 citation statements)
references
References 22 publications
0
37
0
Order By: Relevance
“…The draft genome of C. intestinihominis AF73-05CM02 T has been deposited at DDBJ/EMBL/GenBank under the accession number MAIQ00000000 . The data that support the findings of this study have also been deposited into CNGB Sequence Archive (CNSA) ( Guo et al, 2020 ) of China National GeneBank DataBase (CNGBdb) ( Chen et al, 2020 ) with accession number CNPhis0003415 .…”
Section: Data Availability Statementmentioning
confidence: 72%
“…The draft genome of C. intestinihominis AF73-05CM02 T has been deposited at DDBJ/EMBL/GenBank under the accession number MAIQ00000000 . The data that support the findings of this study have also been deposited into CNGB Sequence Archive (CNSA) ( Guo et al, 2020 ) of China National GeneBank DataBase (CNGBdb) ( Chen et al, 2020 ) with accession number CNPhis0003415 .…”
Section: Data Availability Statementmentioning
confidence: 72%
“…FamDB files contain family consensi/HMMs and the NCBI Taxonomy data related to these families in a format that allows for fast offline access from the command line. The current release of FamDB includes all Dfam consensus sequences, HMMs, metadata, and 61,003 taxa from NCBI’s taxonomy database [ 46 ] related to these families. Lookups for information on a single taxon or family complete in about a second; extraction of consensus sequences (FASTA, EMBL) or HMMs for all TE families found in Human (including ancestral repeats) complete in about 3 to 4 s. Due to indexing, the run time for data queries is largely independent of the total number of TEs in the database: it takes about the same amount of time to extract the human library from a FamDB file including only the curated subset of Dfam (6915 entries) as for the full database (273,655).…”
Section: Software/tool Distribution Improvementsmentioning
confidence: 99%
“…In the RA field, a fully annotated, expert validated, state-of-the-art knowledge base in the form of a molecular map has been published recently, illustrating the molecular and signaling pathways involved in disease pathogenesis [ 183 , 184 ]. However, this map is not cell-specific as it includes experiments in different cell types such as mononuclear cells, synovial fibroblasts, macrophages and chondrocytes.…”
Section: Computational Systems Biology Approachesmentioning
confidence: 99%
“…In RA, the recently published RA map [ 184 ] can serve as a basis for the building of a regulatory graph and the associated logical model. Initially, researchers were set to build a large-scale boolean dynamical model for the study of RA fibroblasts’ activation based on the RA map and a previously published, more generic model on fibroblasts [ 213 ].…”
Section: Computational Systems Biology Approachesmentioning
confidence: 99%