2019 IEEE 16th India Council International Conference (INDICON) 2019
DOI: 10.1109/indicon47234.2019.9030363
|View full text |Cite
|
Sign up to set email alerts
|

VFNet: A Convolutional Architecture for Accent Classification

Abstract: Understanding accent is an issue which can derail any human-machine interaction. Accent classification makes this task easier by identifying the accent being spoken by a person so that the correct words being spoken can be identified by further processing, since same noises can mean entirely different words in different accents of the same language. In this paper, we present VFNet (Variable Filter Net), a convolutional neural network (CNN) based architecture which captures a hierarchy of features to beat the p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(17 citation statements)
references
References 5 publications
0
15
0
2
Order By: Relevance
“…Three object detection deep learning architectures, including you only look once (YOLO) version 3 (Redmon & Farhadi, 2018), Faster R-CNN (Ren et al, 2017), and variable filter net (VFNet, Ahmed et al, 2019) were investigated for weed detection. YOLOv3 is a widely used single-stage object detector (Redmon & Farhadi, 2018).…”
Section: Object Detectionmentioning
confidence: 99%
“…Three object detection deep learning architectures, including you only look once (YOLO) version 3 (Redmon & Farhadi, 2018), Faster R-CNN (Ren et al, 2017), and variable filter net (VFNet, Ahmed et al, 2019) were investigated for weed detection. YOLOv3 is a widely used single-stage object detector (Redmon & Farhadi, 2018).…”
Section: Object Detectionmentioning
confidence: 99%
“…In order to distinguish different accents in English, Teixeira et al [10] proposed to use context-dependent HMM units to optimize parallel networks and Deshpande et al [11] introduced format frequency features into GMM models. Ahmed et al [12] presented VFNet (Variable Filter Net), a convolutional neural network (CNN) based architecture which applies filters with variable size along the frequency band to capture a hierarchy of features, aiming at improving the accuracy of accent recognition in dialogues. Winata et al [13] proposed an accent-agnostic approach that extends the model-agnostic meta-learning (MAML) algorithm for fast adaptation to unseen accents.…”
Section: Related Workmentioning
confidence: 99%
“…In order to distinguish different accents in English, Teixeira et al [12] proposed to use context-dependent HMM units to optimize parallel networks and Deshpande et al [13] introduced format frequency features into GMM models. Ahmed et al [14] presented VFNet (Variable Filter Net), a convolutional neural network (CNN) based architecture which applies filters with variable size along the frequency band to capture a hierarchy of features, aiming at improving the accuracy of accent recognition in dialogues. Winata et al [15] proposed an accent-agnostic approach that extends the model-agnostic meta-learning (MAML) algorithm for fast adaptation to unseen accents.…”
Section: Related Workmentioning
confidence: 99%