2010
DOI: 10.1093/nar/gkq1019
|View full text |Cite
|
Sign up to set email alerts
|

The Sequence Read Archive

Abstract: The combination of significantly lower cost and increased speed of sequencing has resulted in an explosive growth of data submitted into the primary next-generation sequence data archive, the Sequence Read Archive (SRA). The preservation of experimental data is an important part of the scientific record, and increasing numbers of journals and funding agencies require that next-generation sequence data are deposited into the SRA. The SRA was established as a public repository for the next-generation sequence da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
1,788
0
3

Year Published

2011
2011
2021
2021

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 2,402 publications
(1,893 citation statements)
references
References 9 publications
3
1,788
0
3
Order By: Relevance
“…Libraries were downloaded from NCBI Sequence Read Archive (SRA) [82] and converted to FASTQ format using either SRAdb v1.40.0 [83] or fastq-dump v2.8.2 [82]. We preprocessed paired-end and single-end libraries using Trimmomatic v0.36 [84] in order to trim known adapters and/or low quality ends.…”
Section: Methodsmentioning
confidence: 99%
“…Libraries were downloaded from NCBI Sequence Read Archive (SRA) [82] and converted to FASTQ format using either SRAdb v1.40.0 [83] or fastq-dump v2.8.2 [82]. We preprocessed paired-end and single-end libraries using Trimmomatic v0.36 [84] in order to trim known adapters and/or low quality ends.…”
Section: Methodsmentioning
confidence: 99%
“…Mining public databases to elucidate the distribution of the 'Diapherotrites' in nature We attempted to identify the occurrence of members of the CP-'Diapherotrites' in various ecosystems by mining metagenomic data sets in the IMG database (n ¼ 893, accessed in December 2013), Sangergenerated 16S rRNA gene sequences in the nr database (n ¼ 53 65 062 sequences, accessed in January 2014) and partial, high-throughput (pyrosequencing and Illumina)-generated archaeal 16S rRNA gene sequences in MG-RAST (Meyer et al, 2008) and SRA archive (Leinonen et al, 2011) (n ¼ 31 972 882 sequences in 775 data sets generated using archaeal primers). Identification of CP-'Diapherotrites' in metagenomic data sets was conducted using the three 'Diapherotrites' SAG assemblies for anchoring metagenomic reads as previously described (Rinke et al, 2013).…”
Section: Principal Component Analysis (Pca)mentioning
confidence: 99%
“…In this article I describe an approach how it is possible to process ChIP-seq data from different experiments automatically, starting either from the SRA format files from NCBI [2], or FASTQ format files, or BAM format files which contain aligned reads. Among the many different available genome alignment tools I use the BWA.…”
Section: Introductionmentioning
confidence: 99%