2015
DOI: 10.1002/jcb.25049
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Database Performance for Genomic Data

Abstract: Genomic regions represent features such as gene annotations, transcription factor binding sites and epigenetic modifications. Performing various genomic operations such as identifying overlapping/non-overlapping regions or nearest gene annotations are common research needs. The data can be saved in a database system for easy management, however, there is no comprehensive database built-in algorithm at present to identify overlapping regions. Therefore I have developed a novel region-mapping (RegMap) SQL-based … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 31 publications
0
7
0
Order By: Relevance
“…Primary tables are given an identity which is used as a foreign key in the secondary tables. We believe this work will help to improve the effectiveness of queries using SQL and future benchmarking efforts [39]. Relational schema generated for such linked tables should be beneficial for replying to better and more complex queries.…”
Section: Discussionmentioning
confidence: 99%
“…Primary tables are given an identity which is used as a foreign key in the secondary tables. We believe this work will help to improve the effectiveness of queries using SQL and future benchmarking efforts [39]. Relational schema generated for such linked tables should be beneficial for replying to better and more complex queries.…”
Section: Discussionmentioning
confidence: 99%
“…MatCol reports colocalisations as the number of objects found to overlap between two channels. This number could be saved in a database which could allow machine learning of large scale analysis among different cell-lines and treatment conditions 26 , 27 . The thresholding method used by MatCol (Equation 1 ) is simple to understand and implement in other tools, if the maintainability of MatCol becomes an issue in the future.…”
Section: Discussionmentioning
confidence: 99%
“…MatQuantify segments an ROI based on its area and can therefore be used to measure 19 structural properties of any organelle at any magnification. The output is saved into a tab-delimited text file, which can be imported into a database for large-scale analysis [ 18 ]. The user needs to know the size of their structure-of-interest, which can be worked out by trial and error.…”
Section: Discussionmentioning
confidence: 99%