Abstract. This paper shows how an explicit representation of units and quantities can improve the experience of semantically published documents, and provides a first authoring method in this respect. To exemplify the potential and practical advantages of encoding explicit semantics regarding units w.r.t. user experience, we demonstrate a unit system preference service, which enables the user to choose the system of units for the displayed paper. By semantically publishing units, we obtain a basis for a wide range of applications and services such as unknown unit lookup, unit and quantity semantic search and unit and quantity manipulation. Enabling semantic publishing for units is also presented in the context of a large collection of legacy scientific documents (the arXMLiv corpus), where our approach allows to non-invasively enrich legacy publications.
MotivationUnits and quantities 1 , although widely spread, lack a formal standard representation for semantic publishing. A multitude of problems [45] arise from the different flavors (country specific unit standards) and formats (abbreviations, special cases of occurrence) of units, making it hard for the untrained reader to fully understand the information provided. Semantic publishing solves most such problems by disambiguating the unit and quantity occurrences, thus enabling a wide range of applications and services to interact with them.A unit is any determinate quantity, dimension, or magnitude adopted as a basis or standard of measurement for other quantities of the same kind and in terms of which their magnitude is calculated or expressed [32], but from the top-most level of perception, it simply provides information on a wide range of quantifiable aspects. Concrete examples for the great extent of units and quantities include cooking recipes, medical prescriptions, scientific papers and many other. Semantic publishing can provide the middle layer that would ensure an (automated) way of identifying and understanding these occurrences, which can enable the evolution of useful technologies and services.1 We chose the units and quantities wording in order to emphasize the semantic dependence between the unit and its quantity (i.e. amount, magnitude). An in-depth analysis of the two concepts is provided in section 2 of [9].