Ho-Gyeong Kim scite author profile

While the end-to-end speech recognition models show impressive performance on many domains, they have difficulties in decoding long-form utterances. The overlapped inference algorithm with tie-breaking between two parallel hypotheses has been proposed for long-form speech recognition and shows dramatic performance improvements at the expense of double computational costs. In this paper, we propose a more effective way of overlapped inference by aligning partially matched hypotheses. Through the experiment on LibriSpeech dataset, the proposed algorithm showed improved performance with less computational cost compared to the conventional overlapped inference.

show abstract

Adaptable Multi-Domain Language Model for Transformer ASR

Lee

Kang

et al. 2021

View full text Add to dashboard Cite

We propose an adapter based multi-domain Transformer based language model (LM) for Transformer ASR. The model consists of a big size common LM and small size adapters. The model can perform multi-domain adaptation with only the small size adapters and its related layers. The proposed model can reuse the full finetuned LM which is fine-tuned using all layers of an original model. The proposed LM can be expanded to new domains by adding about 2% of parameters for a first domain and 13% parameters for after second domain. The proposed model is also effective in reducing the model maintenance cost because it is possible to omit the costly and time-consuming common LM pre-training process. Using proposed adapter based approach, we observed that a general LM with adapter can outperform a dedicated music domain LM in terms of word error rate (WER).

show abstract

Multibank Optimized Redundancy Analysis Using Efficient Fault Collection

Kim

Lee

Han

et al. 2022

IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst.

View full text Add to dashboard Cite

Adaptable Multi-Domain Language Model for Transformer ASR

Lee¹,

Lee²,

Kang³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ho-Gyeong Kim

Knowledge Distillation Using Output Errors for Self-attention End-to-end Models

Partially Overlapped Inference for Long-Form Speech Recognition

Adaptable Multi-Domain Language Model for Transformer ASR

Multibank Optimized Redundancy Analysis Using Efficient Fault Collection

Adaptable Multi-Domain Language Model for Transformer ASR

Contact Info

Product

Resources

About