This paper takes its point of departure in an overview of the overlap problem, and of proposed solutions to that problem. We then look at some analogies between bracketed markup notations and rules for well-formedness and structuring of simple parenthetical expressions. We propose a method for building lattices from marked up documents with and without overlap, and for generating, from these lattices, document models in the form of trees for XML documents, and in the form of GODDAGs for documents with overlap. It turns out that one and the same method can be used for generating both kinds of models, and we argue that lattices can also be used to implement well-formedness constraints for both kinds of documents. Finally, we discuss and compare some of the algebraic features of the document models, and the relations between them.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.