The DBLP Computer Science Bibliography evolved from an early small experimental Web server to a popular service for the computer science community. Many design decisions and details of the public XML-records behind DBLP never were documented. This paper is a review of the evolution of DBLP. The main perspective is data modeling. In DBLP persons play a central role, our discussion of person names may be applicable to many other data bases. All DBLP data are available for your own experiments. You may either download the complete set, or use a simple XML-based API described in an online appendix.
We demonstrate how the contemporary problems of data acquisition for dblp can be tackled with OXPath. It enables web data extraction and wrapper maintenance for heterogeneous data sources on a simple declarative level. Its features render it a feasible instrument to retrieve the varying and changing web representations of the prototypical substructures in the bibliographic domain. CCS CONCEPTS •Information systems →Digital libraries and archives;
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.