Research communications often introduce biases or misrepresentations without providing reliable links to the research they use so readers can check the veracity of the claims being made. We tested the feasibility of a tool that can be used to automatically recommend research articles to research communications. From 207,538 vaccination-related PubMed articles, we selected 3,573 unique links to webpages using Altmetric. We tested a method for ranking research articles relative to each webpage using a canonical correlation analysis (CCA) approach. Outcome measures were the median rank of the correct source article; the percentage of webpages for which the source article was correctly ranked first; and the percentage ranked within the top 50 candidate articles. The best of the baseline approaches ranked the matching source article first for more a quarter of webpages; and within the top 50 for more than half. Augmenting baseline methods with CCA improved results but failed when added to some of the baseline approaches. The best CCA-based approach ranked the matching source articles first for 14%; and in the top 50 for 38%. Tools to help people identify source articles for vaccination-related research communications are potentially feasible and may support the prevention of bias and misrepresentation of research in news and social media.Keywords research communications · news media · information retrieval · vaccination
BackgroundThe communication of health and medical research online provides a critical resource for the public. More than three-quarters of the UK public report an interest in biomedical research, with 42% having actively sought out content relating to medical or health research in 2015 [17]. Nearly all searches for health information take place online via search engines [3,9,11,17], making Internet searches a common way for people to engage with medical research and associated media communications, and have the potential to alter their healthcare beliefs and decisions [35].Research communications in news and social media face several challenges. Studies with fewer participants and of lower methodological rigour are more common in news media [13,32], and research from authors with conflicts of interest tend to receive more attention in news and social media [12]. As many as half of all news reports manipulate or sensationalise study results to emphasise the benefits of experimental treatments [37].Despite issues with the reliability of health information online, most people trust what they encounter [10,11], and are inconsistent in their efforts to validate health information using appropriate sources [8,11], likely because they find it