SottoVoce

Kimura, Naoki; Kono, Michinari; Rekimoto, Jun

doi:10.1145/3290605.3300376

Cited by 87 publications

(25 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Similar to VUIs, SSIs allow users to converse with computers in natural language, which provides expressive commands without requiring them to remember complicated actions or gestures. Existing SSIs are characterized by what kind of sensing methods and biosignals are used, such as tracking the movement of speech articulators using electromagnetic articulography (EMA) [13,17,53], vocal tract imaging using ultrasound imaging [22,35], capturing subtle sounds produced by non-audible murmur (NAM) [59][60][61] and ingressive speech [15], placing capacitive sensors inside the mouth [33,40], and capturing facial electrical activity using electromyography (sEMG) [31,65]. In the field of Brain-Computer Interfaces (BCI), researchers seek to decode human speech directly from the electrical activity of the brain, where the approaches can be categorized into invasive systems implanted in the cerebral cortex using electrocorticography (ECoG) [1,49] and non-invasive systems attached to the scalp using Electroencephalogram (EEG) [18,20,47].…”

Section: Silent Speech Interfacementioning

confidence: 99%

“…SSI provides seamless and confidential interactions in various situations, especially in those where voice interaction is inappropriate or unavailable. Recent research on SSI has proposed to use various sensing methods such as Electromyography (EMG) [31,43], ultrasound imaging [35], capacitive sensing [40] and video camera (lipreading) [34,45,58] to track the movement of speech articulators and decode silent speech.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

LipLearner: Customizable Silent Speech Interactions on Mobile Devices

Su,

Fang,

Rekimoto

2023

Preprint

View full text Add to dashboard Cite

Figure 1: Example interaction of LipLearner. A) Voice2Lip in-situ command registration. The user records a silent speech command by vocalizing it once, then LipLearner automatically learns to lip-read it with the text recognized from the voice signal as the label. B) The command then can be used without vocalization, triggered by a silent keyword. LipLearner enables silent speech recognition which can be used in public settings (e.g., on the subway). Furthermore, it leverages incremental learning to proactively extend the model's knowledge when new samples become available.

show abstract

Section: Silent Speech Interfacementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

LipLearner: Customizable Silent Speech Interactions on Mobile Devices

Su,

Fang,

Rekimoto

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Az ún. némabeszéd-interfész (Silent Speech Interface, SSI) az artikuláció-akusztikum konverziós módszerek egy olyan távlati alkalmazása, amelynek használatával némán beszélve, ,,tátogva" adhatunk ki hangot a gép segítségével (Denby et al 2010;Csapó-Grósz-Tóth-Markó 2017;Kimura et al 2019) A némabeszéd-interfész segítheti olyan emberek kommunikációját, akik egy betegség (például tumor eltávolítása érdekében végzett hangszalagműtét) vagy baleset következtében elvesztették a hangalkotási képességüket, ugyanakkor még tudnak artikulálni. Ehhez az szükséges, hogy a beszédsérült felhasználó néma artikulációját felvegyük valamilyen eszközzel (például hordozható nyelvultrahanggal), majd gépi tanulás segítségével (például okostelefonon) ebből beszédet tudjunk szintetizálni.…”

Section: öSszegzésunclassified

Tanulmányok a nyelvészet alkalmazásainak területéről

2022

Beszéd, Kutatás, Alkalmazás

View full text Add to dashboard Cite

“…Silent speech recognition (SSR) technology provides a solution to the aforementioned challenges because it does not rely on acoustic signals but other medium. Several signal modalities have been applied to realize SSR by capturing the movement of articulatory muscles or extracting neural information, such as the electromagnetic arthrography [5], the ultrasound or optical images of tongue or lips [6]- [8], the electromyogram (EMG) [9]- [12], and the electroencephalogram [13], [14].…”

Section: Introductionmentioning

confidence: 99%

Decoding Silent Speech Based on High-Density Surface Electromyogram Using Spatiotemporal Neural Network

Chen

Zhang

Chen

et al. 2023

IEEE Trans. Neural Syst. Rehabil. Eng.

View full text Add to dashboard Cite

Finer-grained decoding at a phoneme or syllable level is a key technology for continuous recognition of silent speech based on surface electromyogram (sEMG). This paper aims at developing a novel syllable-level decoding method for continuous silent speech recognition (SSR) using spatio-temporal end-to-end neural network. In the proposed method, the high-density sEMG (HD-sEMG) was first converted into a series of feature images, and then a spatio-temporal end-to-end neural network was applied to extract discriminative feature representations and to achieve syllable-level decoding. The effectiveness of the proposed method was verified with HD-sEMG data recorded by four pieces of 64-channel electrode arrays placed over facial and laryngeal muscles of fifteen subjects subvocalizing 33 Chinese phrases consisting of 82 syllables. The proposed method outperformed the benchmark methods by achieving the highest phrase classification accuracy (97.17 ± 1.53%, p < 0.05), and lower character error rate (3.11 ± 1.46%, p < 0.05). This study provides a promising way of decoding sEMG towards SSR, which has great potential applications in instant communication and remote control.

show abstract

SottoVoce

Cited by 87 publications

References 37 publications

LipLearner: Customizable Silent Speech Interactions on Mobile Devices

LipLearner: Customizable Silent Speech Interactions on Mobile Devices

Tanulmányok a nyelvészet alkalmazásainak területéről

Decoding Silent Speech Based on High-Density Surface Electromyogram Using Spatiotemporal Neural Network

Contact Info

Product

Resources

About