In recent years, academia, institutions, and policymakers have been focusing their attention on the impact of data in digital markets. The economic literature that explicitly models data and their collection as strategic variables is growing, but most studies focus on distinct settings with specific data uses. This survey aims to organise this literature to extract general insights that hold across different models and assumptions. To do so, I identify three classes of models according to the way they model data collection. I find that each class is characterised by a specific impact of data on the market outcomes, regardless of the specific data use. First, when firms obtain data without strategic interactions, their use has a pro-competitive effect on the market. However, firms fail to fully internalise the data externalities, leading to data overuse and, in turn, privacy concerns. Second, when firms collect data from their interaction with consumers, data can facilitate market tipping, especially if firms are asymmetric in their starting positions. Third, when firms acquire data from data intermediaries, data are strategically sold to temper competition in the downstream market, allowing intermediaries to extract most of the surplus at the expense of firms and consumers. These general insights can facilitate future research and help policymakers to have a more general understanding of the competitive effects of data, depending on the situation at hand.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.