In this paper we propose to study the evolution of legal statements that can be found in web sites. Legal statements are an important part of each web site because they can be seen as a contract between the owner of the site and its users. For example, a site's privacy policy explains what kind of data is collected from users by the operator and how it is processed. Operators use terms of use to put restrictions on the conduct of users. In this paper we describe our proposal for a research agenda and methodology that analyzes the evolution of legal statements on the web. The research agenda argues that studying the content of legal statements and how they change over time allows to analyze and understand the evolution of the web from different viewpoints. Specifically, changing legal statements allow to identify emerging legal developments, to expose shifting business objectives, and to track the balance of power between operators and users.Our suggested methodology proposes to obtain historical snapshots of web sites available in the Internet Archive, to group them into different classes, and to analyze the content of the legal document as well as to compute metrics such as size and readability scores. The obtained data can then be used to formulate hypotheses about the evolution of certain characteristics of the web. We discuss a pilot study that instantiates our methodology. This study is based on five snapshots of 15 different web sites, and it shows that the methodology is feasible and can generate meaningful results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.