2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854953
|View full text |Cite
|
Sign up to set email alerts
|

Improved musical onset detection with Convolutional Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
160
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 171 publications
(164 citation statements)
references
References 7 publications
2
160
0
Order By: Relevance
“…fully connected) layer with one output neuron and tanh (hyperbolic tangent) activation function. The neural architecture basically follows the scheme proposed by Schlüter and Böck in [27] with some modifications concerning -apart from the aforementioned regression, replacing classification -mostly the type of nonlinearity of the layers. We agree with [27] that the rectified linear units in the first convolutional layer may play the role of the half-wave rectifier H function (cf.…”
Section: A Neural Network Architecturementioning
confidence: 99%
See 3 more Smart Citations
“…fully connected) layer with one output neuron and tanh (hyperbolic tangent) activation function. The neural architecture basically follows the scheme proposed by Schlüter and Böck in [27] with some modifications concerning -apart from the aforementioned regression, replacing classification -mostly the type of nonlinearity of the layers. We agree with [27] that the rectified linear units in the first convolutional layer may play the role of the half-wave rectifier H function (cf.…”
Section: A Neural Network Architecturementioning
confidence: 99%
“…For the tanh activation function the optimal threshold value T opt determined in our tests, i.e. the value maximizing the F-measure [27][10] was always lower than zero. After the thresholding, the peak-picking procedure is applied and peaks found within the range of 50ms relative to the actual onsets are treated as the properly detected ones.…”
Section: Fig 1 Spectrogram Fragment Enlarged (In a Black Box Lowermentioning
confidence: 99%
See 2 more Smart Citations
“…For instance, a multi-net approach proposed by Lacoste and Eck (2007) is based on merging the results obtained from several networks, each trained with a different set of hyper-parameters, by means of an additional "output" neural network followed by a peak-picking procedure. Apart from the standard questions regarding the number of hidden layers and hidden neurons, several different NN types, including the recurrent neural network (RNN), the feed-forward convolutional neural network (CNN) and the LSTM (long short-term memory) neural network, have been considered (Böck et al, 2012;Eyben et al, 2010;Schlüter and Böck, 2014).…”
Section: 3mentioning
confidence: 99%