2018
DOI: 10.5281/zenodo.1342401
|View full text |Cite
|
Sign up to set email alerts
|

Jakobovski/Free-Spoken-Digit-Dataset: V1.0.8

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…The convolutional network was implemented in Keras ( Chollet, 2018 ) with Tensorflow ( Abadi et al, 2016 ) back-end. All main results were confirmed by analyzing a standard speech data set—the so called Jakobovski free spoken digit data set (FSDD) ( Jackson et al, 2018 ), containing spoken numbers from 0 to 9 in English language in accordance to the MNIST data set with written digits in this range ( LeCun et al, 1998 ). This was done using a completely new code base exclusively build of KERAS layers.…”
Section: Methodsmentioning
confidence: 76%
See 2 more Smart Citations
“…The convolutional network was implemented in Keras ( Chollet, 2018 ) with Tensorflow ( Abadi et al, 2016 ) back-end. All main results were confirmed by analyzing a standard speech data set—the so called Jakobovski free spoken digit data set (FSDD) ( Jackson et al, 2018 ), containing spoken numbers from 0 to 9 in English language in accordance to the MNIST data set with written digits in this range ( LeCun et al, 1998 ). This was done using a completely new code base exclusively build of KERAS layers.…”
Section: Methodsmentioning
confidence: 76%
“…The second used data set is an open data set consisting of spoken digits (0–9)–in analogy to the MNIST data set– in English. The data set is sampled with 8 kHz and consists of 2,000 recorded digits from four speakers ( Jackson et al, 2018 ). Here the first five repetitions of for each speaker and each digit are used as test data, the respective remaining 45 repetitions serve as training data.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…As such, we can use the C-LSTM architecture to conduct a controlled inquiry into our research question. and the Free Spoken Digit dataset (Jackson et al, 2018). We selected these datasets because of their tractability.…”
Section: Multimodal Convolutional Lstm Modelmentioning
confidence: 99%