2010
DOI: 10.1504/ijdmb.2010.033520
|View full text |Cite
|
Sign up to set email alerts
|

Detecting distant homologies on protozoans metabolic pathways using scientific workflows

Abstract: Bioinformatics experiments are typically composed of programs in pipelines manipulating an enormous quantity of data. An interesting approach for managing those experiments is through workflow management systems (WfMS). In this work we discuss WfMS features to support genome homology workflows and present some relevant issues for typical genomic experiments. Our evaluation used Kepler WfMS to manage a real genomic pipeline, named OrthoSearch, originally defined as a Perl script. We show a case study detecting … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 47 publications
0
7
0
Order By: Relevance
“…OrthoSearch (Cruz et al, 2010) is a scientific workflow developed under VisTrails Scientific Workflow Management System (Callahan et al, 2006). It was used to infer orthologous groups between protozoa and COG/KOG.…”
Section: Orthology Identificationmentioning
confidence: 99%
See 1 more Smart Citation
“…OrthoSearch (Cruz et al, 2010) is a scientific workflow developed under VisTrails Scientific Workflow Management System (Callahan et al, 2006). It was used to infer orthologous groups between protozoa and COG/KOG.…”
Section: Orthology Identificationmentioning
confidence: 99%
“…While the blast-based OrthoMCL software (Li et al, 2003) was used to infer orthologs among protozoa species, OrthoSearch (Cruz et al, 2010) was used to infer ProtozoaUnicellular and Protozoa-Eukaryote orthologs. The latter software was originally designed as a scientific workflow for orthology inference using an HMM-based approach.…”
mentioning
confidence: 99%
“…In this sub-section, we describe how we used OrthoSearch [10], a genomic workflow, oriented to analyze large volumes of biological data on genomic databases like COG/KOG [30], to detect distant homologs genes in the parasites that cause neglected tropical diseases (NTD). OrthoSearch was chosen due to two key characteristics.…”
Section: Case Study: a Scientific Workflow To Detect Homologiesmentioning
confidence: 99%
“…OrthoSearch is clearly a data-massive application, both computationally and from a workflow perspective [10]. In order to compare the overhead of Matriohska we prepared six configurations (1 to 32 instances) to execute both with the fragment of OrthoSearch and a similar pipeline with the same sequence of bioinformatics packages.…”
Section: Initial Evaluation Of Matriohskamentioning
confidence: 99%
“…Besides a simple categorization effort [ 26 ], we will follow Dalquen’s proposition [ 29 ]. Briefly, three distinct approaches are available: (i) the one which use multiple sequence alignment (MSA) scores along with reciprocal best hits, such as OrthoSearch [ 30 ], OrthoMCL [ 24 ] and InParanoid [ 31 ]; (ii) that which rely on evolutionary distance calculus, as RSD [ 32 , 33 ]; (iii) and that based on phylogenetic trees reconstruction, as SPIMAP [ 34 ].…”
Section: Introductionmentioning
confidence: 99%