2017
DOI: 10.1142/s0129054117400093
|View full text |Cite
|
Sign up to set email alerts
|

Computing the Expected Edit Distance from a String to a Probabilistic Finite-State Automaton

Abstract: Received (Day Month Year) Accepted (Day Month Year) Communicated by (xxxxxxxxxx)In a number of fields, it is necessary to compare a witness string with a distribution. One possibility is to compute the probability of the string for that distribution. Another, giving a more global view, is to compute the expected edit distance from a string randomly drawn to the witness string. This number is often used to measure the performance of a prediction, the goal then being to return the median string, or the string wi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
7
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 21 publications
0
7
0
Order By: Relevance
“…In this adaptation, the prototype generation stage of the RHC algorithm required the computation of the median value of a set of strings. Given that this computation is an NP-hard problem (Calvo-Zaragoza et al 2017a), the median computation was tackled by considering a set-median strategy instead.…”
Section: Background In Data Reductionmentioning
confidence: 99%
See 2 more Smart Citations
“…In this adaptation, the prototype generation stage of the RHC algorithm required the computation of the median value of a set of strings. Given that this computation is an NP-hard problem (Calvo-Zaragoza et al 2017a), the median computation was tackled by considering a set-median strategy instead.…”
Section: Background In Data Reductionmentioning
confidence: 99%
“…In spite of its conceptual simplicity, the computation of the median value in the string domain still constitutes an open research question owing to the fact that it is an NP-complete problem (Calvo-Zaragoza et al 2017a). This signifies that while works such as that of Kruskal (1983) propose strategies for the exact median calculation of this median value in the string domain, its applicability is severely conditioned by its extremely low efficiency.…”
Section: Background In Data Reductionmentioning
confidence: 99%
See 1 more Smart Citation
“…As aforementioned, this algorithm replaces same-class subsets of prototypes by new elements generated by estimating their median value. Thus, the main issue to tackle is the actual retrieval of the median value of a group of strings, which in our case we resort to the set median as the calculus of the exact median string constitutes an NP-hard problem [18]. Additionally, in order to compare the performance of RHC strategy in both statistical and structural spaces, we make use of the Dissimilarity Space (DS) technique [19] to map the initial strings representation onto a feature-based codification so that additional conclusions can be gathered.…”
Section: Introductionmentioning
confidence: 99%
“…The main consideration in this design is the actual computation of the median string. As it has been introduced, the retrieval of the exact median value of a set of strings is known to be a NP-hard problem [18]. Thus, in this case we consider the set-median operation due to its lower complexity.…”
mentioning
confidence: 99%