Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-long.561
|View full text |Cite
|
Sign up to set email alerts
|

Can Sequence-to-Sequence Models Crack Substitution Ciphers?

Abstract: Decipherment of historical ciphers is a challenging problem. The language of the target plaintext might be unknown, and ciphertext can have a lot of noise. State-of-the-art decipherment methods use beam search and a neural language model to score candidate plaintext hypotheses for a given cipher, assuming the plaintext language is known. We propose an end-to-end multilingual model for solving simple substitution ciphers. We test our model on synthetic and real historical ciphers and show that our proposed meth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(8 citation statements)
references
References 15 publications
0
8
0
Order By: Relevance
“…Seq2Seq Following (Aldarrab and May, 2021), this is a character level Transformer architecture that is only optimized on the target-side (plaintext) loss: Target-Only CausalLM is only optimized on the target-side loss L T GT , and incurs no loss when generating the source text.…”
Section: Modelling Symbol Recurrence Relationsmentioning
confidence: 99%
See 4 more Smart Citations
“…Seq2Seq Following (Aldarrab and May, 2021), this is a character level Transformer architecture that is only optimized on the target-side (plaintext) loss: Target-Only CausalLM is only optimized on the target-side loss L T GT , and incurs no loss when generating the source text.…”
Section: Modelling Symbol Recurrence Relationsmentioning
confidence: 99%
“…For simple substitution ciphers, we use the same English data as above to create 1.2M synthetic substitution ciphers with lengths up to 256. Following previous work on 1:1 ciphers (Nuhn et al, 2013;Aldarrab and May, 2021), we evaluate on 50 test ciphers of lengths up to 128 (16,32,64) and beyond 128 (128,256) from the Wikipedia page on History 5 . All our experimental settings include data with word boundaries denoted by the space symbol (_).…”
Section: Modelling Symbol Recurrence Relationsmentioning
confidence: 99%
See 3 more Smart Citations