Bytecover: Cover Song Identification Via Multi-Loss Training

Du, Xingjian; Yu, Zhesong; Zhu, Beien; Chen, Xiaoou; Ma, Zejun

doi:10.1109/icassp39728.2021.9414128

Cited by 17 publications

(14 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the selfattention layer, we use a random mask to erase some value of the embedding to improve the robustness further. As a result, the introduced time-pooling module leverages feature statistics and pays more attention to the discriminative frames, which leads to clear outperformance compared to the widely-adopted generalized mean pooling (GeM) in recent approaches [6], [7], [10]. More ablation studies will be shown in section III-D.…”

Section: B Time Domain Pooling Modulementioning

confidence: 99%

“…We use the embedding before the bottleneck layer to compute the contrast loss (i.e., triplet loss [14]) and use the embedding after the bottleneck layer to compute the focal loss [15] and center loss [16]. Note that we use the focal loss to replace the traditional cross-entropy loss as in [6], improving the performance when facing data unbalance. Center loss [16] helps the training convergence and achieves higher performance.…”

Section: Lossmentioning

confidence: 99%

“…Datasets. To fairly compare our method with others, we follow [6], [10] and use SHS100K [17] as the training dataset. We download the data with URLs from second-hand-songs website 1 and use the original setting 2 to split all recordings into train/dev/test sets.…”

Section: Experiments a Experiments Settingsmentioning

confidence: 99%

“…For example, Yu et al proposed TPPNet (temporal pyramid pooling) [4] and CNN-based CQTNet [5] for learning the characteristics of the cover song task. Byte-Cover [6] and ByteCover2 [7] achieved state-of-the-art results with a ResNet-IBN50 [8] backbone and multi-loss training with cross-entropy and triplet loss. PiCKINet [9] proposed Pitch Class Blocks in order to maintain the key-invariance features of music.…”

Section: Introductionmentioning

confidence: 99%

“…To overcome such shortcomings, we propose to use the Conformer [11] to capture global information with the selfattention mechanism while still accounting for local clues with convolutions. Moreover, when pooling the features across the time domain to obtain a fixed-length audio embedding, we abandon the popular choice of average pooling or generalized mean pooling [6] that treats the features equivalently across time. Instead, we introduce a self-attention mechanism to amplify the contribution of discriminative frames and suppress redundant information that is unrelated to identifying cover songs.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Analysis of Social Media Messaging for Health Promotion by the CDC from 2018 to2019 in Guangzhou, China (Preprint)

Liu¹,

Ma²

2019

Preprint

View full text Add to dashboard Cite

BACKGROUND As Internet use becomes increasingly widespread, mobile devices, such as mobile phones, have become the most important channel for many people to obtain information. The number of Chinese mobile phone users has reached 1.28 billion, with more than 90% of usersaccessing social media via the mobile application WeChat. As an increasinglyubiquitous app in China, WeChat had1.04 billion monthly active users worldwide by the first quarter of 2018.[1,2]A recent national survey in China found that one-third of participants regularly read health education articles on WeChat, and 98.53% of participants choose to useWeChat for health information seeking, indicatingthatWeChat is the most popularplatform for health information acquisition in China.[3]“Everyone is the media and everyone has a microphone” in theInternet era,and the rapiddevelopment of communications technologyhas playedan increasingly important role in information dissemination. WeChat is the most popular social media platform in China, and 93% of residents in major Chinese cities are reported to log into WeChatdaily. Social media platforms, particularly WeChat, arewidely utilized by health organizations, and are fast becoming the principal instruments of alternative communication channels fordelivering health messages, conducting disease surveillance, spreading health awareness, and communicatingabout public health issues to the public.WeChat has emerged as a powerful platform with several advantages compared with more traditional communication channels, and has proven to be a cost-effective tool for the dissemination of health messages, capable of reaching minority groupsto improve public health interventions. In recent years, there has been increasing public interest in searching for online health information related to health problems. Evidence from local studies has reportedthat members of thepublic perceive information provided online to be useful and reliable, a perception that leads many peopleto consult with health care practitioners about their health conditions directly through social media. Growing online health-seeking behaviors and the increasing numbers of non-authorized health websites or social media accounts sharingbiasedor inaccurate health information involvingconflicts of interest have made it necessary for health organizations to engage with internet users on social media in a strategic way.[4]To adapt to the reading habits of the public in the Internet era, various institutions have utilized the new forms of media as important channels for government propaganda and the provision of services. WeChat is a new form of media,utilizing mobile phones as terminals, enabling instantaneous and social communication. WeChat is an important platform for audiences to obtain government information. At present, the daily search volume of medical and health problems from online has reached 60 million. A report published by the Ministry of Science Popularization of the Chinese Association of Science and Technology and the Institute of Science Popularization of China titled“Big Data of Internet Popular Science Reading 2017”reported that the popular science articles read most often by citizensin 2017 werethose focused on health and medical treatment. [3] The Guangzhou Centerfor Disease Controland Prevention(GZCDC) i-Health is anofficial WeChat account operated by the GZCDC, officially opened in April 2018. The postspublished via this account are mainly original, focusing on health tips and popular science. Because of differences in functionality between WeChat and other social media platforms, such as Facebook and Twitter, similar studies of social media are not generalizable to WeChat. Thus,specific investigations of the effectiveness of WeChat for health promotion are important.In the current study, posts published between April 2018 and April 2019 were analyzed to understand the public demand for the official CDC WeChat account, and to provide scientific evidence toinform the development of better communication strategies. OBJECTIVE To analyze the datapublished by the official Guangzhou Centerfor Disease Controland Prevention (GZCDC) i-Health WeChat account and explore the factors influencing theeffectsoninformation dissemination, soasto improve the effectiveness of health information dissemination. METHODS Data were collected from the official Guangzhou i-Health WeChat account between 1 April 2018 and 30April 2019. Descriptive analysiswas performed for the basic information regarding the WeChat account and its posts. Multiple-factor logistic regression analysis was used to explore the association between various factors of posts on engagementof followers, and the impact of the WeChat account was examined usingadjusted odds ratios (AOR) and the corresponding 95% confidence intervals (95% CI) for the variables. RESULTS Thetotal numbers of page views, shares, likes, adds to favorites, and comments for213 posts were 1147308, 84671, 10973, 5535 and 1865, respectively, from 187033followers.The engagement of followers peaked on the dissemination date and gradually declined. The main post topicswere health education posts and original posts, and 46.48% of 213posts were reposted by other official WeChat accounts.In a multiple-factor logistic regression model, the number of page viewswas found to be significantly associated withinfectious disease posts (AOR:3.20 95%CI: 1.16–8.81), original posts (AOR:10.20, 95%CI:1.17–89.28), and posts that could be judged from the title (AOR:2.93,95%CI:1.16–8.81). Vaccine-relatedevent posts (AOR:15.78, 95%CI:3.39–73.42), child and adolescent health posts (AOR:17.33, 95%CI:1.50–200.31), environmental health posts (AOR:7.94, 95%CI:1.90–33.21), chronic disease posts(AOR:4.05, 95%CI:1.18–13.45),nutrition and food-borne diseaseposts(AOR:3.91, 95%CI:1.25–12.20), infectious disease posts (AOR:3.86, 95%CI:1.36–10.98),original posts (AOR:10.22, 95%CI:1.06–98.85), posts focused on current events (AOR:3.04, 95%CI=1.36–6.76) and headline posts (AOR:5.48, 95%CI:1.14–26.41) were positively associated with being reposted by other official WeChat accounts. CONCLUSIONS Content is the most important factor in the effectiveness of official WeChat accounts at GZCDC institutions, and should be focused on health tips and popular science articles, while enhancing interactions with the public.

show abstract

Section: B Time Domain Pooling Modulementioning

confidence: 99%