Adaptable Multi-Domain Language Model for Transformer ASR

Lee, Taewoo; Lee, Min-Joong; Kang, Tae Gyoon; Jung, Seokyeoung; Kwon, Minseok; Hong, Yeona; Lee, Jungin; Woo, Kyoung-Gu; Kim, Ho-Gyeong; Jeong, Jiseung; Lee, Ji‐Hyun; Lee, Hosik; Choi, Young Sang

doi:10.1109/icassp39728.2021.9413475

Cited by 5 publications

(2 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similar work was performed for other languages like Bengali, Japanese, etc. Also, more speech corpus is collected from the young people for many languages (Zeng et al, 2020;Lee et al, 2021). However, speaker fluctuation, environmental noise, and transmission channel noise all degrade ASR performance.…”

Section: Related Workmentioning

confidence: 99%

Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil

Bharathi¹,

Chakravarthi²,

Cn³

et al. 2022

Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion

View full text Add to dashboard Cite

This paper illustrates the overview of the shared task on automatic speech recognition in the Tamil language. In the shared task, spontaneous Tamil speech data gathered from elderly and transgender people was given for recognition and evaluation. These utterances were collected from people when they communicated in the public locations such as hospitals, markets, vegetable shop, etc. The speech corpus includes utterances of male, female, and transgender and was split into training and testing data. The given task was evaluated using WER (Word Error Rate). The participants used the transformer-based model for automatic speech recognition. Different results using different pre-trained transformer models are discussed in this overview paper.

show abstract

Section: Related Workmentioning

confidence: 99%

Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil

Bharathi¹,

Chakravarthi²,

Cn³

et al. 2022

Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion

View full text Add to dashboard Cite

show abstract

“…Additionally, 13% WER is reduced by LSTM decoder (Zeng et al, 2021). Transformer model encoding and decoding can be carried with self-attention and multi-head attention layer (Lee et al, 2021). For CTC/Attention based End-To-End ASR, the transformer model is used, which result 23.66% of WER (Miao et al, 2020).…”

Section: Related Workmentioning

confidence: 99%

SUH_ASR@LT-EDI-ACL2022: Transformer based Approach for Speech Recognition for Vulnerable Individuals in Tamil

Suhasini¹,

Bharathi²

2022

Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion

View full text Add to dashboard Cite

An Automatic Speech Recognition System is developed for addressing the Tamil conversational speech data of the elderly people and transgender. The speech corpus used in this system is collected from the people who adhere their communication in Tamil at some primary places like bank, hospital, vegetable markets. Our ASR system is designed with pre-trained model which is used to recognize the speech data. WER(Word Error Rate) calculation is used to analyse the performance of the ASR system. This evaluation could help to make a comparison of utterances between the elderly people and others. Similarly, the comparison between the transgender and other people is also done. Our proposed ASR system achieves the word error rate as 39.65%.

show abstract

Attention‐Based End‐to‐End Automatic Speech Recognition System for Vulnerable Individuals in Tamil

Suhasini,

Bharathi,

Chakravarthi

2024

Automatic Speech Recognition and Translation for Low Resource Languages

View full text Add to dashboard Cite

Adaptable Multi-Domain Language Model for Transformer ASR

Cited by 5 publications

References 21 publications

Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil

Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil

SUH_ASR@LT-EDI-ACL2022: Transformer based Approach for Speech Recognition for Vulnerable Individuals in Tamil

Attention‐Based End‐to‐End Automatic Speech Recognition System for Vulnerable Individuals in Tamil

Contact Info

Product

Resources

About