A Query-by-Singing System for Retrieving Karaoke Music

Yu, Hung-Ming; Tsai, Wei-Ho; Wang, Hsin‐Min

doi:10.1109/tmm.2008.2007345

Cited by 37 publications

(18 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The query may be transposed by, e.g., all possible numbers of semitones within the octave (Yu et al, 2008) or from −5 to +5 semitones in half-of-the-semitone steps (Jang et al, 2011). Various numbers of repetitions may be considered but in any way this is clearly a brute-force approach which increases the computational complexity significantly.…”

Section: Melody Matchingmentioning

confidence: 99%

“…In general, the proposed tune follower and its adaptive variant enable to efficiently refine the results without computationally complex methods such as repeating the DTW for all possible transpositions (Yu et al, 2008). It should be noted that they can be used independently from efficient indexing techniques (Zhu, Shasha, 2003; Keogh, 2002) or note-based approximate algorithms to increase the speed and reliability of a QBSH-based search engine.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Follow That Tune-Adaptive Approach to DTW-based Query-by-Humming System

Stasiak¹

2015

Archives of Acoustics

View full text Add to dashboard Cite

Dynamic Time Warping is a standard algorithm used for matching time series irrespective of local tempo variations. Its application in the context of Query-by-Humming interface to multimedia databases requires providing the transposition independence, which involves some additional, sometimes computationally expensive processing and may not guarantee the success, e.g., in the presence of a pitch trend or accidental key changes.The method of tune following, proposed in this paper, enables solving the pitch alignment problem in an adaptive way inspired by the human ability of ignoring typical errors occurring in sung melodies. The experimental validation performed on the database containing 4431 queries and over 5000 templates confirmed the enhancement introduced by the proposed algorithm in terms of the global recognition rate.

show abstract

Section: Melody Matchingmentioning

confidence: 99%

mentioning

confidence: 99%

Follow That Tune-Adaptive Approach to DTW-based Query-by-Humming System

Stasiak¹

2015

Archives of Acoustics

View full text Add to dashboard Cite

show abstract

“…Common input query types in MIR systems are example [7][8][9][10], singing [11,12] and humming [11,13]. The popularity of multimedia devices (e.g.…”

Section: Introductionmentioning

confidence: 99%

A Survey on Query-by-Example based Music Information Retrieval

Borjian¹

2017

IJCA

View full text Add to dashboard Cite

ABSTRACTsearch in huge musical datasets using a query provided as a fragment of desired song while there exists no extra information is a particular concern in content-based music information retrieval (MIR), defined as query-by-example (QBE). A number of QBE based MIR systems have evolved in recent years, which search a desired song without any manual of its originality, such as title, composer, singer or etc., and return a list of songs ranked in descending order according to the similarity with the given query recorded by user on TV, in gym or so on. Although, too much attention has been paid to this topic by researchers and developers in several communities, such as information retrieval, data mining or multimedia browsing engines, but it still suffers from no existing a unique definition on structure, aim, similarity, performance and also output results. This paper focuses on providing a brief overview of available QBE based MIR systems to manifest variety, opportunities and challenges in this area.

show abstract

“…The comprehensive online music discography Discogs.com lists over 200,000 releases containing an instrumental mix but only about 40,000 which include an a cappella mix. The availability of these separated mixes are crucial in the creation and performance of some genres of music [3,4,5]. These instrumental and a cappella versions can also be used as ground-truth for vocal removal or isolation algorithms [6].…”

Section: Introductionmentioning

confidence: 99%

“…A simple approach is proposed in [5] where an optimally shifted and scaled instrumental mix is subtracted from the complete mix in the time or frequency domain in attempt to obtain a (previously unavailable) a cappella mix. However, this approach does not cover the more general case where different mixes may be extracted from different media (e.g.…”

Section: Introductionmentioning

confidence: 99%

Estimating timing and channel distortion across related signals

Raffel

Ellis

2014

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

We consider the situation where there are multiple audio signals whose relationship is of interest. If these signals have been differently captured, the otherwise similar signals may be distorted by fixed filtering and/or unsynchronized timebases. Examples include recordings of signals before and after radio transmission and different versions of musical mixes obtained from CDs and vinyl LPs. We present techniques for estimating and correcting timing and channel differences across related signals. Our approach is evaluated in the context of artificially manipulated speech utterances and two source separation tasks.

show abstract

A Query-by-Singing System for Retrieving Karaoke Music

Cited by 37 publications

References 25 publications

Follow That Tune-Adaptive Approach to DTW-based Query-by-Humming System

Follow That Tune-Adaptive Approach to DTW-based Query-by-Humming System

A Survey on Query-by-Example based Music Information Retrieval

Estimating timing and channel distortion across related signals

Contact Info

Product

Resources

About