“…To cope with this problem, most previous works have tried to automatically associate sounds with words for queryby-text retrieval or music annotation [4,5,6,7,8,9,10,11]. Recently, inference techniques based on topic models, such as probabilistic latent semantic analysis (pLSA) and latent Dirichlet allocation (LDA), have been exploited for automatic image annotation and retrieval [12,13].…”