Shawn Jones scite author profile

Shawn Jones

5Publications

50Citation Statements Received

76Citation Statements Given

How they've been cited

How they cite others

Affiliations

Los Alamos National Laboratory, Old Dominion University, Duke University

Publications

Order By: Most citations

Scholarly Context Adrift: Three out of Four URI References Lead to Changed Content

et al. 2016

View full text Add to dashboard Cite

Increasingly, scholarly articles contain URI references to “web at large” resources including project web sites, scholarly wikis, ontologies, online debates, presentations, blogs, and videos. Authors reference such resources to provide essential context for the research they report on. A reader who visits a web at large resource by following a URI reference in an article, some time after its publication, is led to believe that the resource’s content is representative of what the author originally referenced. However, due to the dynamic nature of the web, that may very well not be the case. We reuse a dataset from a previous study in which several authors of this paper were involved, and investigate to what extent the textual content of web at large resources referenced in a vast collection of Science, Technology, and Medicine (STM) articles published between 1997 and 2012 has remained stable since the publication of the referencing article. We do so in a two-step approach that relies on various well-established similarity measures to compare textual content. In a first step, we use 19 web archives to find snapshots of referenced web at large resources that have textual content that is representative of the state of the resource around the time of publication of the referencing paper. We find that representative snapshots exist for about 30% of all URI references. In a second step, we compare the textual content of representative snapshots with that of their live web counterparts. We find that for over 75% of references the content has drifted away from what it was when referenced. These results raise significant concerns regarding the long term integrity of the web-based scholarly record and call for the deployment of techniques to combat these problems.

show abstract

Persistent URIs Must Be Used To Be Persistent

Sompel

Klein

Jones

2016

View full text Add to dashboard Cite

show abstract

Social Cards Probably Provide For Better Understanding Of Web Archive Collections

Jones

Weigle

Nelson

2019

View full text Add to dashboard Cite

Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their own research? Search engine results and social media links are represented as surrogates, small easily digestible summaries of the underlying page. Search engines and social media have a different focus, and hence produce different surrogates than web archives. Search engine surrogates help a user answer the question "Will this link meet my information need?" Social media surrogates help a user decide "Should I click on this?" Our use case is subtly different. We hypothesize that groups of surrogates together are useful for summarizing a collection. We want to help users answer the question of "What does the underlying collection contain?" But which surrogate should we use? With Mechanical Turk participants, we evaluate six different surrogate types against each other. We find that the type of surrogate does not influence the time to complete the task we presented the participants. Of particular interest are social cards, surrogates typically found on social media, and browser thumbnails, screen captures of web pages rendered in a browser. At p = 0.0569, and p = 0.0770, respectively, we find that social cards and social cards paired side-by-side with browser thumbnails probably provide better collection understanding than the surrogates currently used by the popular Archive-It web archiving platform. We measure user interactions with each surrogate and find that users interact with social cards less than other types. The results of this study have implications for our web archive summarization work, live web curation platforms, social media, and more. CCS CONCEPTS• Information systems → Digital libraries and archives; World Wide Web; Web searching and information discovery; • Applied computing → Digital libraries and archives.

show abstract

Hypercane: Intelligent Sampling for Web Archive Collections

Jones

Weigle

Klein

et al. 2021

View full text Add to dashboard Cite

Improving Collection Understanding for Web Archives with Storytelling: Shining Light Into Dark and Stormy Archives

Jones¹

2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shawn Jones

Scholarly Context Adrift: Three out of Four URI References Lead to Changed Content

Persistent URIs Must Be Used To Be Persistent

Social Cards Probably Provide For Better Understanding Of Web Archive Collections

Hypercane: Intelligent Sampling for Web Archive Collections

Improving Collection Understanding for Web Archives with Storytelling: Shining Light Into Dark and Stormy Archives

Contact Info

Product

Resources

About