2019 Data Compression Conference (DCC) 2019
DOI: 10.1109/dcc.2019.00062
|View full text |Cite
|
Sign up to set email alerts
|

Constructing Antidictionaries in Output-Sensitive Space

Abstract: A word x that is absent from a word y is called minimal if all its proper factors occur in y. Given a collection of k words y 1 , y 2 , . . . , y k over an alphabet Σ, we are asked to compute the set M y1#...#y k of minimal absent words of length at most of word y = y 1 #y 2 # . . . #y k , # / ∈ Σ. In data compression, this corresponds to computing the antidictionary of k documents. In bioinformatics, it corresponds to computing words that are absent from a genome of k chromosomes. This computation generally r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(18 citation statements)
references
References 24 publications
0
18
0
Order By: Relevance
“…A preliminary version of this paper appeared as [1]. Compared to the preliminary version, we have extended the work by adding a simplified space-efficient version of the algorithm (see Section 4).…”
Section: N=1mentioning
confidence: 99%
See 3 more Smart Citations
“…A preliminary version of this paper appeared as [1]. Compared to the preliminary version, we have extended the work by adding a simplified space-efficient version of the algorithm (see Section 4).…”
Section: N=1mentioning
confidence: 99%
“…The set of all words over Σ of length at most is denoted by Σ ≤ . We fix a constant-sized alphabet Σ, i.e., |Σ| = O (1). Given a word y = uxv over Σ, we say that u is a prefix of y, x is a factor (or subword) of y, and v is a suffix of y.…”
Section: Preliminariesmentioning
confidence: 99%
See 2 more Smart Citations
“…Motivated by these two concepts and the role they play, we study in this paper the set of absent subsequences of a string w, i.e., the set of strings which are not subsequences of w. As such, our investigation is also strongly related to the study of missing factors (or missing words, MAWs) in strings, where the focus is on the set of strings which are not substrings (or factors) of w. The literature on the respective topic ranges from many very practical applications of this concept [5,13,14,20,47,53] to deep theoretical results of combinatorial [10,19,22,23,24,44,43] or algorithmic nature [1,2,5,6,15,17,27]. Absent subsequences are also related to the well-studied notion of patterns avoided by permutations, see for instance [35], with the main difference being that a permutation is essentially a word whose letters are pairwise distinct.…”
Section: Introductionmentioning
confidence: 99%