2019
DOI: 10.1093/nar/gkz1063
|View full text |Cite
|
Sign up to set email alerts
|

The European Nucleotide Archive in 2019

Abstract: The European Nucleotide Archive (ENA, https://www.ebi.ac.uk/ena) at the European Molecular Biology Laboratory’s European Bioinformatics Institute provides open and freely available data deposition and access services across the spectrum of nucleotide sequence data types. Making the world’s public sequencing datasets available to the scientific community, the ENA represents a globally comprehensive nucleotide sequence resource. Here, we outline ENA services and content in 2019 and provide an insight into select… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
111
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 106 publications
(114 citation statements)
references
References 17 publications
0
111
0
1
Order By: Relevance
“…EMBL-EBI hosts the European Nucleotide Archive [1], which has a broader scope, accepting submissions of nucleotide sequencing information, including raw sequencing data, sequence assembly information and functional annotations.…”
Section: Current Scenariomentioning
confidence: 99%
“…EMBL-EBI hosts the European Nucleotide Archive [1], which has a broader scope, accepting submissions of nucleotide sequencing information, including raw sequencing data, sequence assembly information and functional annotations.…”
Section: Current Scenariomentioning
confidence: 99%
“…Then, for each minimizer file, super-k-mers are broken into their constituent k-mers. The k-mers and their count-vectors are inserted into a hash table 1 . When a k-mer is first inserted, it has a count-vector that records the abundance of its originating super-k-mer in the corresponding dataset.…”
Section: Construction Of the Monotigsmentioning
confidence: 99%
“…When a count-vector is written to the disk, its monotig identifier given by BLight is also recorded next to it. Then, reading each file separately, we select a single representative of each vector by inserting it into an efficient dynamic hash table 1 . Once a partition is processed, we write the set of de-duplicated count vectors to disk, and record the mapping between monotig indices and their positions in the de-duplicated count-vector matrix.…”
Section: Low-memory De-duplication Of Rows In the Matrixmentioning
confidence: 99%
See 1 more Smart Citation
“…Per species, extensive literature research was performed to validate their aerobicity (Data S5). 1628 Genomes of facultative anaerobic and strict anaerobic strains from the Pseudomonas genus were obtained from the European Nucleotide Archive repository in March 2015 [27]. All genomes were de-novo annotated in SAPP [28] using Prodigal for gene prediction (version 2.6) [29], 2010] and InterProScan version 5.4-47.0 [30] for functional annotation using Pfam [31].…”
Section: Genome Annotationmentioning
confidence: 99%