VFNet: A Convolutional Architecture for Accent Classification

Ahmed, Asad; Tangri, Pratham; Panda, Anirban; Ramani, Dhruv; Karmakar, Samarjit

doi:10.1109/indicon47234.2019.9030363

Cited by 22 publications

(17 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Three object detection deep learning architectures, including you only look once (YOLO) version 3 (Redmon & Farhadi, 2018), Faster R-CNN (Ren et al, 2017), and variable filter net (VFNet, Ahmed et al, 2019) were investigated for weed detection. YOLOv3 is a widely used single-stage object detector (Redmon & Farhadi, 2018).…”

Section: Object Detectionmentioning

confidence: 99%

Drought stress impact on the performance of deep convolutional neural networks for weed detection in Bahiagrass

Zhuang

Jin

Chen

et al. 2022

Grass and Forage Science

View full text Add to dashboard Cite

Machine vision‐based weed detection relies on features such as plant colour, leaf texture, shape, and patterns. Drought stress in plants can alter leaf colour and morphological features, which may in turn affect the reliability of machine vision‐based weed detection. The objective of this research was to evaluate the feasibility of using deep convolutional neural networks for the detection of Florida pusley (Richardia scabra L.) growing in drought stressed and unstressed bahiagrass (Paspalum natatum Flugge). The object detection neural networks you only look once (YOLO)v3, faster region‐based convolutional network (Faster R‐CNN), and variable filter net (VFNet) failed to effectively detect Florida pusley growing in drought stressed or unstressed bahiagrass, with F1 scores ≤0.54 in the testing dataset. Nevertheless, the use of the image classification neural networks AlexNet, GoogLeNet, and Visual Geometry Group‐Network (VGGNet) was highly effective and achieved high (≥0.97) F1 scores and recall values (≥0.98) in detecting images containing Florida pusley growing in drought stressed or unstressed bahiagrass. Overall, these results demonstrated the effectiveness of using an image classification convolutional neural network for detecting Florida pusley in drought stressed or unstressed bahiagrass. These findings illustrate the broad applicability of these neural networks for weed detection.

show abstract

Section: Object Detectionmentioning

confidence: 99%

Drought stress impact on the performance of deep convolutional neural networks for weed detection in Bahiagrass

Zhuang

Jin

Chen

et al. 2022

Grass and Forage Science

View full text Add to dashboard Cite

show abstract

“…In order to distinguish different accents in English, Teixeira et al [10] proposed to use context-dependent HMM units to optimize parallel networks and Deshpande et al [11] introduced format frequency features into GMM models. Ahmed et al [12] presented VFNet (Variable Filter Net), a convolutional neural network (CNN) based architecture which applies filters with variable size along the frequency band to capture a hierarchy of features, aiming at improving the accuracy of accent recognition in dialogues. Winata et al [13] proposed an accent-agnostic approach that extends the model-agnostic meta-learning (MAML) algorithm for fast adaptation to unseen accents.…”

Section: Related Workmentioning

confidence: 99%

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

Shi

et al. 2021

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

The variety of accents has posed a big challenge to speech recognition. The Accented English Speech Recognition Challenge (AESRC2020) is designed for providing a common testbed and promoting accent-related research. Two tracks are set in the challenge -English accent recognition (track 1) and accented English speech recognition (track 2). A set of 160 hours of accented English speech collected from 8 countries is released with labels as the training set. Another 20 hours of speech without labels is later released as the test set, including two unseen accents from another two countries used to test the model generalization ability in track 2. We also provide baseline systems for the participants. This paper first reviews the released dataset, track setups, baselines and then summarizes the challenge results and major techniques used in the submissions.

show abstract

“…In order to distinguish different accents in English, Teixeira et al [12] proposed to use context-dependent HMM units to optimize parallel networks and Deshpande et al [13] introduced format frequency features into GMM models. Ahmed et al [14] presented VFNet (Variable Filter Net), a convolutional neural network (CNN) based architecture which applies filters with variable size along the frequency band to capture a hierarchy of features, aiming at improving the accuracy of accent recognition in dialogues. Winata et al [15] proposed an accent-agnostic approach that extends the model-agnostic meta-learning (MAML) algorithm for fast adaptation to unseen accents.…”

Section: Related Workmentioning

confidence: 99%

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

Shi

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

VFNet: A Convolutional Architecture for Accent Classification

Cited by 22 publications

References 5 publications

Drought stress impact on the performance of deep convolutional neural networks for weed detection in Bahiagrass

Drought stress impact on the performance of deep convolutional neural networks for weed detection in Bahiagrass

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

Contact Info

Product

Resources

About