Black-Box Attacks via the Speech Interface Using Linguistically Crafted Input

Bispham, Mary; Rensburg, Alastair Janse van; Agrafiotis, Ioannis; Goldsmith, Michael

doi:10.1007/978-3-030-49443-8_5

Cited by 1 publication

(2 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They present the results of experimental work showing that it is possible to mislead natural language understanding in third-party applications for Amazon Alexa (known as Skills) by replacing words in target commands or by embedding homophones of target command words in a different sense context so as to create apparently unrelated utterances that are accepted by the system as the target command. In an extended version of the original paper, published in this volume, the authors demonstrate further instances of the latter type of attack on Amazon Alexa Skills as well as on open-source natural language understanding technology RASA NLU (Bispham et al [9]). This type of attack based on embedding homophones of target command words in a different sense context is termed a 'word transplant' attack by the authors.…”

Section: Prior Work On the Security Of The Speech Interfacementioning

confidence: 97%

“…The potential of such an approach to defend against attacks on natural language understanding in voice-controlled systems can be trivially demonstrated in the context of the 'word transplant' attacks on natural language understanding in Alexa Skills and RASA NLU using homophones of target command words demonstrated by Bispham et al [9], using the readily available machine translation technology Google Translate. 9 Google Translate uses RNNs for sequence-tosequence mapping of input in one language to output in another language (see Wu et al [54]). Table 2 shows the successful adversarial commands used in the word transplant attacks on natural language understanding in Alexa Skills and RASA NLU demonstrated by Bispham et al and their translation by Google Translate into German.…”

Section: Defences Against Attacks On Natural Language Understandingmentioning

confidence: 99%

See 1 more Smart Citation

The Security of the Speech Interface: A Modelling Framework and Proposals for New Defence Mechanisms

Bispham

Agrafiotis

Goldsmith

2020

Communications in Computer and Information Science

Self Cite

View full text Add to dashboard Cite

This paper presents an attack and defence modelling framework for conceptualising the security of the speech interface. The modelling framework is based on the Observe-Orient-Decide-Act (OODA) loop model, which has been used to analyse adversarial interactions in a number of other areas. We map the different types of attacks that may be executed via the speech interface to the modelling framework, and present a critical analysis of the currently available defences for countering such attacks, with reference to the modelling framework. The paper then presents proposals for the development of new defence mechanisms that are grounded in the critical analysis of current defences. These proposals envisage a defence capability that would enable voice-controlled systems to detect potential attacks as part of their dialogue management functionality. In accordance with this high-level defence concept, the paper presents two specific proposals for defence mechanisms to be implemented as part of dialogue management functionality to counter attacks that exploit unintended functionality in speech recognition functionality and natural language understanding functionality. These defence mechanisms are based on the novel application of two existing technologies for security purposes. The specific proposals include the results of two feasibility tests that investigate the effectiveness of the proposed mechanisms in defending against the relevant type of attack.

show abstract

Section: Prior Work On the Security Of The Speech Interfacementioning

confidence: 97%

Section: Defences Against Attacks On Natural Language Understandingmentioning

confidence: 99%