2021
DOI: 10.1109/jproc.2021.3082027
|View full text |Cite
|
Sign up to set email alerts
|

An Introduction to MPEG-G: The First Open ISO/IEC Standard for the Compression and Exchange of Genomic Sequencing Data

Abstract: The development and progress of high-throughput sequencing technologies have transformed the sequencing of DNA from a scientific research challenge to practice. With the release of the latest generation of sequencing machines, the cost of sequencing a whole human genome has dropped to less than $600. Such achievements open the door to personalized medicine, where it is expected that genomic information of patients will be analyzed as a standard practice. However, the associated costs, related to storing, trans… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 41 publications
0
10
0
Order By: Relevance
“…For its purpose, we mainly focused on ISO/IEC 23092 features [ 2 , 3 ], as this is a genomic information format which has been developed using privacy by design principles since its inception. Such features, among others, are as follows: protection of information and privacy rules associated with genomic information, including metadata, a hierarchical structuring of the information, starting from the complete file and finishing on the blocks containing genomic information itself, and the possibility of associating security information with a high level of granularity.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…For its purpose, we mainly focused on ISO/IEC 23092 features [ 2 , 3 ], as this is a genomic information format which has been developed using privacy by design principles since its inception. Such features, among others, are as follows: protection of information and privacy rules associated with genomic information, including metadata, a hierarchical structuring of the information, starting from the complete file and finishing on the blocks containing genomic information itself, and the possibility of associating security information with a high level of granularity.…”
Section: Discussionmentioning
confidence: 99%
“…We already described a preliminary version of this system in [ 1 ], but it is still under improvement, as we explain in this paper. Although the proposed system architecture is standards-agnostic, most of the modules implemented in this version are based on the ISO/IEC 23092 standard, also known as MPEG-G (Genomic Information Representation) [ 2 , 3 ], as we further describe in the following sections.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…We benchmarked Genozip 14 against two versions of CRAM: CRAM 3.0, which is the default version in samtools, and CRAM 3.1, the latest CRAM version, for which the current version of samtools (1.15.1) produces the warning “ This is a technology demonstration that should not be used for archival data ”. We acknowledge that other BAM compression systems exist, such as DeeZ (Hach et al ., 2014) and MPEG-G (Voges et al ., 2021). However, they have been shown to be significantly inferior to both CRAM 3.1 and Genozip 13 (Bonfield, 2022), therefore we did not include them in our benchmark.…”
Section: Benchmarkmentioning
confidence: 99%