Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2018
DOI: 10.18653/v1/d18-1102
|View full text |Cite
|
Sign up to set email alerts
|

Decipherment of Substitution Ciphers with Neural Language Models

Abstract: Decipherment of homophonic substitution ciphers using language models (LMs) is a wellstudied task in NLP. Previous work in this topic scores short local spans of possible plaintext decipherments using n-gram LMs. The most widely used technique is the use of beam search with n-gram LMs proposed by Nuhn et al. (2013). We propose a beam search algorithm that scores the entire candidate plaintext at each step of the decipherment using a neural LM. We augment beam search with a novel rest cost estimation that explo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(20 citation statements)
references
References 10 publications
0
19
0
Order By: Relevance
“…These adjustments are informed by assumptions about ciphers used to produce the data (Knight and Yamada, 1999;Knight et al, 2006;Ravi and Knight, 2011;Pourdamghani and Knight, 2017). Besides the commonly used EM algorithm, (Nuhn et al, 2013;Hauer et al, 2014;Kambhatla et al, 2018) also tackles substitution decipherment and formulate this problem as a heuristic search procedure, with guidance provided by an external language model (LM) for candidate rescoring. So far, techniques developed for man-made ciphers have not been shown successful in deciphering archaeological data.…”
Section: Related Workmentioning
confidence: 99%
“…These adjustments are informed by assumptions about ciphers used to produce the data (Knight and Yamada, 1999;Knight et al, 2006;Ravi and Knight, 2011;Pourdamghani and Knight, 2017). Besides the commonly used EM algorithm, (Nuhn et al, 2013;Hauer et al, 2014;Kambhatla et al, 2018) also tackles substitution decipherment and formulate this problem as a heuristic search procedure, with guidance provided by an external language model (LM) for candidate rescoring. So far, techniques developed for man-made ciphers have not been shown successful in deciphering archaeological data.…”
Section: Related Workmentioning
confidence: 99%
“…They usually use EM algorithms, which are tailored towards these specific types of ciphers, most prominently substitution ciphers (Knight and Yamada, 1999;Knight et al, 2006). Later work by Nuhn et al (2013), Hauer et al (2014, and Kambhatla et al (2018) addresses the problem using a heuristic search procedure, guided by a pretrained language model. To the best of our knowledge, these methods developed for tackling man-made ciphers have so far not been successfully applied to archaeological data.…”
Section: Related Workmentioning
confidence: 99%
“…Our work exploits the embedding space learned by a neural language model, but the actual task of language modeling is otherwise irrelevant to our results. By contrast, Kambhatla et al (2018) actually sample text from a neural language model to help estimate the quality of a proposed decipherment. Future work could similarly sample from a language model as a means of counteracting the small size of the PE corpus; this should be done with caution, however, given the difficulty of evaluating whether the sampled text is fluent.…”
Section: Below)mentioning
confidence: 99%