2016
DOI: 10.1101/043430
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Computational Pan-Genomics: Status, Promises and Challenges

Abstract: Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic datasets. Instead, novel, qualitatively different computational methods and paradigms are… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 169 publications
0
4
0
Order By: Relevance
“…Defining a coordinate system on such graph-based reference genomes has been discussed by Marschall et al [2]. They propose that nearby bases should have similar coordinates (spatiality), that coordinates should be concise and interpretable (readability), and that coordinates should increment along the genome (monotonicity).…”
Section: Resultsmentioning
confidence: 99%
“…Defining a coordinate system on such graph-based reference genomes has been discussed by Marschall et al [2]. They propose that nearby bases should have similar coordinates (spatiality), that coordinates should be concise and interpretable (readability), and that coordinates should increment along the genome (monotonicity).…”
Section: Resultsmentioning
confidence: 99%
“…The fastest algorithm to date was able to process seven whole mammalian genomes in under eight hours [82]. This makes feasible to attack another problem: pan‐genome assembly [83], that is the construction of the assembly graph of several individual genomes.…”
Section: Open Problems and Future Directionsmentioning
confidence: 99%
“…In particular, low-abundance strains can interfere with sequencing errors in common error correction routines. To date, most assembly tools still aim to assemble consensus sequence, if closely related haplotypes are present (Marschall et al, 2016).…”
Section: Introductionmentioning
confidence: 99%