2016 IEEE International Conference on Big Data (Big Data) 2016
DOI: 10.1109/bigdata.2016.7840727
|View full text |Cite
|
Sign up to set email alerts
|

Scalable genomics: From raw data to aligned reads on Apache YARN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2017
2017
2018
2018

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…The first module (preprocessor ) reads as input raw Illumina data and performs BCL conversion, filtering and demultiplexing. The second module implements the alignment step in Flink, using the Read Aligner API (RAPI [5]) which provides Java bindings for the BWA-MEM aligner. The two modules are connected by a Kafka broker.…”
Section: Methodsmentioning
confidence: 99%
“…The first module (preprocessor ) reads as input raw Illumina data and performs BCL conversion, filtering and demultiplexing. The second module implements the alignment step in Flink, using the Read Aligner API (RAPI [5]) which provides Java bindings for the BWA-MEM aligner. The two modules are connected by a Kafka broker.…”
Section: Methodsmentioning
confidence: 99%
“…Note: This expository section is reproduced almost verbatim from [15], for the reader's convenience.…”
Section: The Ngs Processmentioning
confidence: 99%
“…The first module in our pipeline takes care of preprocessing the raw Illumina data, which are available in the proprietary BCL format. To construct the preprocessor we extended our BCL to FASTQ converter [15], by enabling its output to be sent to a Kafka broker, using the built-in Flink-Kafka connector.…”
Section: A Data Preprocessingmentioning
confidence: 99%
See 1 more Smart Citation