Detecting Cover Songs with Pitch Class Key-Invariant Networks

O'Hanlon, Ken; Benetos, Emmanouil; Dixon, Simon

doi:10.1109/mlsp52302.2021.9596389

Cited by 3 publications

(2 citation statements)

References 23 publications

(45 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…• CQTNet [63]: CQTNet uses carefully designed kernels and dilated convolutions to extend the receptive field, which can improve the model's representation learning capacity. • PICKiNet [33]: PICKiNet devises pitch class blocks to obtain the key-invariant musical features.…”

Section: Comparison Baselinesmentioning

confidence: 99%

DisCover: Disentangled Music Representation Learning for Cover Song Identification

Xun

Zhang

Yang

et al. 2023

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

View full text Add to dashboard Cite

In the field of music information retrieval (MIR), cover song identification (CSI) is a challenging task that aims to identify cover versions of a query song from a massive collection. Existing works still suffer from high intra-song variances and inter-song correlations, due to the entangled nature of version-specific and version-invariant factors in their modeling. In this work, we set the goal of disentangling version-specific and version-invariant factors, which could make it easier for the model to learn invariant music representations for unseen query songs. We analyze the CSI task in a disentanglement view with the causal graph technique, and identify the intra-version and inter-version effects biasing the invariant learning. To block these effects, we propose the disentangled music representation learning framework (DisCover) for CSI. DisCover consists of two critical components: (1) Knowledge-guided Disentanglement Module (KDM) and (2) Gradient-based Adversarial Disentanglement Module (GADM), which block intra-version and inter-version biased effects, respectively. KDM minimizes the mutual information between the learned representations and version-variant factors that are identified with prior domain knowledge. GADM identifies version-variant factors by simulating the representation transitions between intra-song versions, and exploits adversarial distillation

show abstract

Section: Comparison Baselinesmentioning

confidence: 99%

DisCover: Disentangled Music Representation Learning for Cover Song Identification

Xun

Zhang

Yang

et al. 2023

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

View full text Add to dashboard Cite

show abstract

“…Byte-Cover [6] and ByteCover2 [7] achieved state-of-the-art results with a ResNet-IBN50 [8] backbone and multi-loss training with cross-entropy and triplet loss. PiCKINet [9] proposed Pitch Class Blocks in order to maintain the key-invariance features of music. LyraC-Net [10] utilized WideResNet as the backbone, and combined classification and metric learning for optimization.…”

Section: Introductionmentioning

confidence: 99%

Analysis of Social Media Messaging for Health Promotion by the CDC from 2018 to2019 in Guangzhou, China (Preprint)

Liu¹,

Ma²

2019

Preprint

View full text Add to dashboard Cite

BACKGROUND As Internet use becomes increasingly widespread, mobile devices, such as mobile phones, have become the most important channel for many people to obtain information. The number of Chinese mobile phone users has reached 1.28 billion, with more than 90% of usersaccessing social media via the mobile application WeChat. As an increasinglyubiquitous app in China, WeChat had1.04 billion monthly active users worldwide by the first quarter of 2018.[1,2]A recent national survey in China found that one-third of participants regularly read health education articles on WeChat, and 98.53% of participants choose to useWeChat for health information seeking, indicatingthatWeChat is the most popularplatform for health information acquisition in China.[3]“Everyone is the media and everyone has a microphone” in theInternet era,and the rapiddevelopment of communications technologyhas playedan increasingly important role in information dissemination. WeChat is the most popular social media platform in China, and 93% of residents in major Chinese cities are reported to log into WeChatdaily. Social media platforms, particularly WeChat, arewidely utilized by health organizations, and are fast becoming the principal instruments of alternative communication channels fordelivering health messages, conducting disease surveillance, spreading health awareness, and communicatingabout public health issues to the public.WeChat has emerged as a powerful platform with several advantages compared with more traditional communication channels, and has proven to be a cost-effective tool for the dissemination of health messages, capable of reaching minority groupsto improve public health interventions. In recent years, there has been increasing public interest in searching for online health information related to health problems. Evidence from local studies has reportedthat members of thepublic perceive information provided online to be useful and reliable, a perception that leads many peopleto consult with health care practitioners about their health conditions directly through social media. Growing online health-seeking behaviors and the increasing numbers of non-authorized health websites or social media accounts sharingbiasedor inaccurate health information involvingconflicts of interest have made it necessary for health organizations to engage with internet users on social media in a strategic way.[4]To adapt to the reading habits of the public in the Internet era, various institutions have utilized the new forms of media as important channels for government propaganda and the provision of services. WeChat is a new form of media,utilizing mobile phones as terminals, enabling instantaneous and social communication. WeChat is an important platform for audiences to obtain government information. At present, the daily search volume of medical and health problems from online has reached 60 million. A report published by the Ministry of Science Popularization of the Chinese Association of Science and Technology and the Institute of Science Popularization of China titled“Big Data of Internet Popular Science Reading 2017”reported that the popular science articles read most often by citizensin 2017 werethose focused on health and medical treatment. [3] The Guangzhou Centerfor Disease Controland Prevention(GZCDC) i-Health is anofficial WeChat account operated by the GZCDC, officially opened in April 2018. The postspublished via this account are mainly original, focusing on health tips and popular science. Because of differences in functionality between WeChat and other social media platforms, such as Facebook and Twitter, similar studies of social media are not generalizable to WeChat. Thus,specific investigations of the effectiveness of WeChat for health promotion are important.In the current study, posts published between April 2018 and April 2019 were analyzed to understand the public demand for the official CDC WeChat account, and to provide scientific evidence toinform the development of better communication strategies. OBJECTIVE To analyze the datapublished by the official Guangzhou Centerfor Disease Controland Prevention (GZCDC) i-Health WeChat account and explore the factors influencing theeffectsoninformation dissemination, soasto improve the effectiveness of health information dissemination. METHODS Data were collected from the official Guangzhou i-Health WeChat account between 1 April 2018 and 30April 2019. Descriptive analysiswas performed for the basic information regarding the WeChat account and its posts. Multiple-factor logistic regression analysis was used to explore the association between various factors of posts on engagementof followers, and the impact of the WeChat account was examined usingadjusted odds ratios (AOR) and the corresponding 95% confidence intervals (95% CI) for the variables. RESULTS Thetotal numbers of page views, shares, likes, adds to favorites, and comments for213 posts were 1147308, 84671, 10973, 5535 and 1865, respectively, from 187033followers.The engagement of followers peaked on the dissemination date and gradually declined. The main post topicswere health education posts and original posts, and 46.48% of 213posts were reposted by other official WeChat accounts.In a multiple-factor logistic regression model, the number of page viewswas found to be significantly associated withinfectious disease posts (AOR:3.20 95%CI: 1.16–8.81), original posts (AOR:10.20, 95%CI:1.17–89.28), and posts that could be judged from the title (AOR:2.93,95%CI:1.16–8.81). Vaccine-relatedevent posts (AOR:15.78, 95%CI:3.39–73.42), child and adolescent health posts (AOR:17.33, 95%CI:1.50–200.31), environmental health posts (AOR:7.94, 95%CI:1.90–33.21), chronic disease posts(AOR:4.05, 95%CI:1.18–13.45),nutrition and food-borne diseaseposts(AOR:3.91, 95%CI:1.25–12.20), infectious disease posts (AOR:3.86, 95%CI:1.36–10.98),original posts (AOR:10.22, 95%CI:1.06–98.85), posts focused on current events (AOR:3.04, 95%CI=1.36–6.76) and headline posts (AOR:5.48, 95%CI:1.14–26.41) were positively associated with being reposted by other official WeChat accounts. CONCLUSIONS Content is the most important factor in the effectiveness of official WeChat accounts at GZCDC institutions, and should be focused on health tips and popular science articles, while enhancing interactions with the public.

show abstract

Acoustics-Text Dual-Modal Joint Representation Learning for Cover Song Identification

Gu,

JingLi,

JiayiZhou

et al. 2023

2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

View full text Add to dashboard Cite

Detecting Cover Songs with Pitch Class Key-Invariant Networks

Cited by 3 publications

References 23 publications

DisCover: Disentangled Music Representation Learning for Cover Song Identification

DisCover: Disentangled Music Representation Learning for Cover Song Identification

Analysis of Social Media Messaging for Health Promotion by the CDC from 2018 to2019 in Guangzhou, China (Preprint)

Acoustics-Text Dual-Modal Joint Representation Learning for Cover Song Identification

Contact Info

Product

Resources

About