2010
DOI: 10.1007/s11390-010-9304-6
|View full text |Cite
|
Sign up to set email alerts
|

Some Algorithmic Challenges in Genome-Wide Ortholog Assignment

Abstract: Genome-scale assignment of orthologous genes is a fundamental and challenging problem in computational biology and has a wide range of applications in comparative genomics, functional genomics, and systems biology. Many methods based on sequence similarity, phylogenetic analysis, chromosomal syntenic information, and genome rearrangement have been proposed in recent years for ortholog assignment. Although these methods produce results that largely agree with each other, their results may still contain signific… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2010
2010
2014
2014

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 50 publications
0
11
0
Order By: Relevance
“…Other approaches maximize the number of genes matched in each family [29,3]. A more general method allowing all gene copies to be kept, and accounting for reversals, translocations, fusions and fissions, has been developed and implemented in the MSOAR software [46,68,104]. Finding the most parsimonious rearrangement process transforming one genome into another constructs, as a byproduct, the list of orthologous gene pairs.…”
Section: Gene Familiesmentioning
confidence: 99%
“…Other approaches maximize the number of genes matched in each family [29,3]. A more general method allowing all gene copies to be kept, and accounting for reversals, translocations, fusions and fissions, has been developed and implemented in the MSOAR software [46,68,104]. Finding the most parsimonious rearrangement process transforming one genome into another constructs, as a byproduct, the list of orthologous gene pairs.…”
Section: Gene Familiesmentioning
confidence: 99%
“…Incorporating this fact into the mathematics of genome comparison and genome reconstruction complicates the formulation of problems and inevitably worsens their complexity. For this introductory essay, therefore, we will keep largely to the single-copy genes case, and leave it to the reader to explore the voluminous literature (starting with [22]) that attempts to generalize to the multicopy case. We will touch on the latter occasionally, especially in Section 4 on genome halving, a context where there are exactly two copies of each gene.…”
Section: Genomesmentioning
confidence: 99%
“…A typical problem involves ambiguous homology or paralogy, due to WGD and other duplication processes, leading to the risk of matching up inappropriate pairs of genes as orthologs in the two genomes [22] . These problems tend to artifactually disrupt long runs of consecutive genes in both genomes, and increase the number of shorter runs, often consisting of only one or two genes.…”
Section: Noisy Genomesmentioning
confidence: 99%
See 1 more Smart Citation
“…More precisely, the reversal diameter of strings of length n is n − max a∈Σ #(a) where #(a) denotes the number of occurrences of letter a in either input string. Signed String Reversal Distance has applications in the identification of orthologous genes across two genomes [5,10,12]. Signed String Reversal Distance is NP-hard if each letter occurs at most twice [5] and for binary signed strings [13].…”
Section: Introductionmentioning
confidence: 99%