2012
DOI: 10.1093/bioinformatics/bts417
|View full text |Cite
|
Sign up to set email alerts
|

SANS: high-throughput retrieval of protein sequences allowing 50% mismatches

Abstract: Motivation: The genomic era in molecular biology has brought on a rapidly widening gap between the amount of sequence data and first-hand experimental characterization of proteins. Fortunately, the theory of evolution provides a simple solution: functional and structural information can be transferred between homologous proteins. Sequence similarity searching followed by k-nearest neighbor classification is the most widely used tool to predict the function or structure of anonymous gene products that come out … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(26 citation statements)
references
References 22 publications
0
26
0
Order By: Relevance
“…First, MP reads and PacBio reads were aligned against existing scaffolds using BWA51 and an in-house SANS aligner52, respectively. After subsequent filtering, the linkage map was used as a guide to determine the most reliable path between the scaffolds to yield individual superscaffolds.…”
Section: Methodsmentioning
confidence: 99%
“…First, MP reads and PacBio reads were aligned against existing scaffolds using BWA51 and an in-house SANS aligner52, respectively. After subsequent filtering, the linkage map was used as a guide to determine the most reliable path between the scaffolds to yield individual superscaffolds.…”
Section: Methodsmentioning
confidence: 99%
“…Although the group members had experience in sequencing and assembling microbial and fungal genomes, performing automatic and manual annotation as well as developing assembly and annotation methods, the M. cinxia genome was the first large genome project for all of us. In addition to the genome and the genome paper, it also yielded new scaffolding , read error correction (Salmela 2010, Salmela & Schröder 2011, functional annotation (Koskinen et al 2015), orthology prediction (Ta et al 2011, Koskinen & Holm 2012, linkage mapping (Rastas et al 2013), and RNA-seq (Kvist et al 2015) and RAD-seq (Rastas et al 2013) library preparation methods. The M. cinxia genome project was also a pioneering project in the sense that it gave many groups in Finland the confidence to initiate a similar investigation as exemplified by the ongoing silver birch (J. Salojärvi unpubl.…”
Section: The Trials and Tribulations Of Building The Melitaea Cinxiamentioning
confidence: 99%
“…These workloads are representative for different scientific computing areas, e.g., bioinformatics, astronomy, or geographic information sciences. Let's take for example, a scientist who wants to simulate small angle scattering (SANS) techniques [7] to classify the shape of a molecule. The normal process in this case involves different runs of the simulation, with the scientist checking the simulation results and changing the input parameters after each run.…”
Section: Motivationmentioning
confidence: 99%